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PRAISE for The Literacy Crisis 


This is the clearest argument for the absolute necessity of engaging children in 
literate activity—reading and writing—since Frank Smith’s Understanding Reading 
was published 25 years ago. McQuillan carefully analyzes a wide variety of research 
to show how commonly observed effects are narrowly interpreted and misinter¬ 
preted by authors seemingly blinded by their existing biases toward particular 
views of how children learn to read. He builds a powerful case for continuing to 
refine and improve modern literacy instruction by enhancing all children’s access 
to appropriate, engaging texts but argues persuasively that it is unequal access that 
produces the inequitable literacy outcomes that bedevil our schools and society. 
Addressing this inequity of access would truly represent an education reform 
initiative based in reliable, replicable research. 

—Dick Allington, author of No Quick Fix: Rethinking Literacy Programs in 
America’s Elementary Schoob 

Are you disturbed by endless claims about the “crisis” in beginning reading and the 
need to teach sounds and skills to solve it? Want a keenly argued, informed answer to 
these claims? Want a solidly researched alternative vision of what genuinely counts in 
successful reading instruction? Want “real solutions”? Then read this book! 

—Gerald Coles, author of The Learning Mystique: A Critical Look at Learning 
Disabilities 

In this book Jeff McQuillan provides the hard, statistical evidence critics of literature- 
based approaches to teaching reading have been demanding. He debunks studies 
that purport to show the superiority of phonics-based approaches to teaching 
reading, and argues that access to interesting books is the key to reading success. 

—David and Yvonne Freeman, authors of Between Worlds: Access to Second 
Language Acqubition 

[McQuillan’s book] is without question one of the most important books on 
literacy ever written. It represents a major breakthrough in our understanding of 
the “literacy crisis.” 

—Stephen Krashen, author of Every Person a Reader: An Alternative to the 
California Task Force on Reading 

Through careful analysis of current research, McQuillan calls into question the 
current overfocus on isolated skills and phonics and demonstrates that access to print 
and the amount of reading students do are the best predictors of reading achieve¬ 
ment. As knowledgeable teachers, we must heed his findings and provide students 
with engaging materials, well-stocked libraries, and lots of opportunities to read. 

—Regie Routman, author of Literacy at the Crossroads: Crucial Talk About Reading, 
Writing, and Other Teaching Dilemmas 



If we take the Star Wars trilogy as a metaphor for the struggle over literacy in this 
country, The Reading Excellence Act, the Texas and California Legislative fiats, and 
the new phonically rich basals represent the Empire Striking Back. With this book, 
Jeff McQuillan announces the Return of the Jedi. The force is with him. 

—Patrick Shannon, author of Reading Poverty 

This book is a “must-have” resource for any administrator, especially those who are 
challenged with misguided and misinformed demands from the public sector. 
Research citations and rebuttals are worth the price of admission to this publication. 

—Judie Thelen, Past-President of the International Reading Association, President 
of Allegany County (MD) Board of Education 

McQuillan offers all the evidence needed to stop the silly reading wars, if the 
generals will only READ IT. To paraphrase that campaign slogan: “It’s the print 
environment, stupid!” That is, the more books, the higher the scores; fewer books, 
lower scores. Every school board member voting on a library issue should be forced 
to read this book before casting a vote. 

—Jim Trelease, author of The Read Aloud Handbook 

By carefully analyzing one research study after another, McQuillan demolishes 
some popular myths about literacy education, including the myth that whole 
language teaching has caused declining test scores, and the myth that intensive, 
systematic phonics is the answer to improving children's reading. Instead, 
McQuillan demonstrates that what may be THE key factor in reading achievement 
is not the teaching method, but ready access to books that children will love—and, 
of course, the time and opportunity to read them. This book should be required 
reading for everyone who has the opportunity to affect public and school policy: 
parents, community members, educators, and certainly political officeholders. 

—Constance Weaver, author of Reading Process and Practice: From Socio- 
Psycholinguistics to Whole Language, Second Edition 

Three cheers for Jeff McQuillan! For presenting an impressive array of evidence to 
debunk the myths that kids can’t read today, that dyslexia is rampant, and that 
whole-language advocates deny the importance of research. This powerful, 
important book demolishes the claims, so prevalent today, that phonemic aware¬ 
ness and phonics instruction are central to learning to read, and shows that simple 
ingredients like time spent reading and quality libraries are what most help 
children become literate. 

—Sandra Wide, author of What's a Schwa Sound Anyway? A Holistic Guide to 
Phonetics, Phonics, and Spelling 
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What Isn't Wrong with Reading: 
Seven Myths about Literacy 
in the United States 


The simple truth is that the American public school system is slowly crip¬ 
pling the country by destroying the brains of its youngest citizens. . . 
America is literally losing its brains. 

—Samuel Blumenfeld (1995, p. 196) 

Reading failure is epidemic. Declining test scores in reading have been noted 
in many states, most visibly in California, where the inability of children to 
read, spell, and write well has become a matter of widespread concern. 

—Orton Dyslexia Society (1997, p. 2) 

We are turning into a nation of illiterates . 

—Stephen Pinker (McGuinness, 1997, p. ix) 


S erious problems exist with reading achievement in many United States schools. 
Teachers are frustrated, parents puzzled, and children themselves discouraged 
over low reading proficiency. I have talked with teachers who tell me of such 
difficulties at all levels, and I witnessed them with my own eyes both when I worked 
as a public school teacher and when I visit schools today. 

Yet in the midst of media coverage of our (latest) “literacy crisis,” we should be 
very clear about what is and is not failing in our schools. Ironically, a great deal of 
the accepted wisdom about how United States students are performing academically 
is, in fact, false, which makes getting at the real problems our children face all the 
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more difficult. The best evidence we have on the reading crisis is quite sobering: No 
crisis exists on average in United States reading, at least not of the sort reflected by 
the quotes above. However, other serious problems related to literacy are routinely 
ignored. If we are to understand how best to help those who truly need it, we must 
first determine what isn t true about the current status of reading achievement in 
America. Below are what I consider to be seven of the most prevalent—and damag¬ 
ing—myths about literacy achievement in the United States. Some of these have been 
more fully discussed by researchers elsewhere (see especially Berliner & Biddle, 1995; 
and Bracey, 1997a). 


Myth 1: Reading Achievement in the United States 
Has Declined in the Past Twenty-five Years 

What is striking about reading achievement scores in the United States over the past 
twenty-five years is not how they’ve declined, but just how stable they have been. Sev¬ 
eral researchers have reviewed and published detailed analyses of the state of read¬ 
ing in the United States, and nearly all of them have come to the same conclusion: 
Children in the United States are reading as well now as they did a generation ago, 
and maybe better (Berliner & Biddle, 1995; Bracey, 1997a). 

The best evidence on reading achievement in the United States comes from a 
national system of examinations established back in the late 1960s by the federal gov¬ 
ernment to determine how United States schoolchildren were performing in a vari¬ 
ety of school subjects, especially reading and mathematics. These exams, collectively 
know as the National Assessment of Educational Progress, or the NAEP, are impor¬ 
tant barometers of educational achievement. Unlike commercial tests given by many 
school districts, such as the Comprehensive Test of Basic Skills or the Iowa Test of 
Basic Skills, the NAEP is given nationally to a representative sample of United States 
children, so that we can extrapolate more confidently from its results just how well 
the nation as a whole is doing. 

Table 1.1 shows the reading scores of United States children at three age lev¬ 
els from the first year the NAEP was administered, 1971, to the most recent admin¬ 
istration in 1996. The test is scored on a scale of 0 to 500. It is immediately apparent 
that, despite a few minor fluctuations, reading achievement has either stayed even 
or increased slightly over the past three decades. No decline whatsoever is indicated 
at any level. 

Other indicators of reading achievement show similar stability or growth, in¬ 
cluding commercial tests at all levels, from the Iowa Test of Basic Skills to the Gradu¬ 
ate Record Examination (see Berliner & Biddle, 1995, for details). Even very young 
children appear to be doing better than their age peers of two decades ago. Mayer 
(1997) reports that a 1992 national sample of five-to-seven-year-olds scored 5.1 
points higher on the reading portion of the Peabody Individual Achievement Test 
(PLAT) than the same age children in 1970.' 
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Table 1.1 

Average Reading Proficiency of 9-, 13-, and 17-Year-Olds 
in the United States, 1971-1996 


AGE 

1971 

1975 

1980 

1984 

1988 

1990 

1992 

1994 

1996 

9 years 

208 

210 

215 

211 

212 

209 

211 

211 

212 

13 years 

255 

256 

259 

257 

258 

257 

260 

258 

259 

17 years 

285 

286 

286 

289 

290 

290 

290 

288 

287 


Source: Campbell, Reese, O’Sullivan, and Dossey (1996), p. 106; Campbell, Voelld, and 
Donahue (1997), p. 104. 


Myth 2: Forty Percent of United States Children 
Can’t Read at a Basic Level 

There is no question that some children in the United States cannot read very well. 
But is it true, as some politicians and researchers have stated, that nearly half of our 
children are below a “basic” level in reading (McGuinness, 1997)? The claim has its 
origin in the way in which the United States Department of Education now has de¬ 
cided to report their biennial NAEP results, reports which themselves need some in¬ 
terpreting. During the early years of the NAEP tests, the Department released only 
the raw scores for each age level on its 0 to 500 scale, with no designations of what 
score was thought to constitute “basic” or “proficient” levels of reading. The design¬ 
ers of the NAEP test later decided, however, that simply reporting the raw scores was 
no longer adequate in order to judge the progress of United States schools. To give 
some supposed meaning to these scores, the Department decided it would determine 
how well students were reading by establishing what some have interpreted to be “ob¬ 
jective” cutoff points, indicating the minimum score constituting “below basic,” “ba¬ 
sic,” “proficient,” and “advanced” reading for each age level tested. 2 The basic level 
for fourth-grade reading, for example, was fixed at a score of 208. In 1994,40 per¬ 
cent of United States children scored below the “basic” cutoff of 208. 

The problem with this approach lies in “objectively” determining where these 
cutoff points should be. Is it really possible to come up with a minimal level of com¬ 
petency in reading? One of the country’s leading researchers and theorists in the area 
of educational statistics, Gene Glass, thinks not. Glass (1978), after reviewing the vari¬ 
ous methods proposed for creating “minimal” criterion scores of performance, con¬ 
cluded that all such efforts are necessarily arbitrary, amounting to little more than 
“pseudoquantification” (p. 239). One reason for such pessimism is not difficult to 
understand: One person’s “below basic” is the next person’s “proficient.” It might be 
argued, though, that “experts” would certainly agree on what students should know 
at a given age, and indeed the NAEP levels were based on “collective judgements, 
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gathered from a broadly representative panel of teachers, education specialists, and 
members of the general public” (Campbell, Donahue et al., 1996, p. 6). But what if 
Expert Panel A disagrees with Expert Panel B? And even if standards are agreed upon, 
the actual implementation of them will fall to a group of test writers, each of whom 
may write more or less difficult items. Glass noted that no amount of mathematical 
wizardry can make this problem of arbitrariness go away. Indeed, any teacher knows 
that you can make a test so easy that everyone passes or so difficult that everyone fail<= 
Of course, such arbitrary cutoff points already exist in education and many other 
fields, but at least they are recognized as arbitrary, and not given the status of abso¬ 
lute or objective levels of competence. 

The fallacy of obtaining “objective” definitions of “basic” reading abilities is il¬ 
lustrated by the real-world attempts to implement them. One may be tempted to 
think, for example, that while truly “objective” measures of competence are not pos¬ 
sible, surely we can get those knowledgeable about reading to reach some common 
understanding about what a “basic” reading level looks like at a given grade level, 
right? One answer to this question is found in Table 1.2, which shows the percentage 
of students in 14 states who were judged to be “proficient” in reading at the third- 
and fourth-grade levels according to two different tests. In the first column is the 


Table 1.2 


Percentage of 3rd/4th Grade Students “Proficient” in 

Reading, 1994-95 

STATE 

% PROFICIENT 

% PROFICIENT 

DIFFERENCE BETWEEN 


ON NAEP EXAM 

ON STATE EXAM 

STATE AND NATIONAL 


(4TH GRADERS) 

(3RD/4TH GRADERS) 

ESTIMATES 

Connecticut 

38 

48 

10 

Delaware 

23 

11 

12 

Georgia 

26 

67 

41 

Indiana 

35 

66 

31 

Kentucky 

26 

28 

2 

Louisiana 

15 

88 

73 

Maryland 

26 

34 

8 

New Hampshire 

36 

26 

10 

North Carolina 

30 

65 

35 

Rhode Island 

32 

65 

33 

South Carolina 

20 

82 

62 

Tennessee 

27 

62 

35 

Texas 

26 

79 

53 

Wisconsin 

35 

88 

53 


(from “Comparing Test Results,” Education Week, January 15,1997) 
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percentage as determined by the 1994 NAEP test; in the second column, the indi¬ 
vidual state’s estimate based on its own choice of a standardized reading test. Note 
that most states established their cutoff scores based on the advice of their own ex¬ 
pert panel, each no doubt as qualified as the next. Yet in almost every case states came 
up with different results—sometimes dramatically so—in the number of students 
who were thought to be good readers. Louisiana, for example, had 88 percent of its 
students reading at a proficient level on its own assessment, but only 15 percent on 
the NAEP test—a difference of 73 percentage points! Delaware judged only 11 per¬ 
cent of its students as reading proficiently but showed twice that (23 percent) as pro¬ 
ficient on the NAEP tests. Wisconsin had only 35 percent of its children score at the 
proficient level on the NAEP, but judged an impressive 88 percent proficient on its 
own exam. Each exam, each panel, each state came up with a different determina¬ 
tion of a “proficient” reader. Overall, there was an average difference of nearly 33 per¬ 
centage points between the two estimates of proficient readers. 

All “minimal,” “basic,” or “proficient” cutoff points on educational tests are 
necessarily relative. Of course, as in any normal distribution of scores, some students 
will be at the top of the curve, some at the bottom, and most right around average. 
The only thing we can really determine from reading tests (or math tests or social 
studies tests) is whether children are doing better or worse than their age peers of 
previous years. And what we know about such performance in reading is that it has 
been completely stable for the past two-and-a-half decades, as noted previously. If 
nearly half of children in the United States today are truly below a “basic” level of 
reading, then that has always been the case, which might make one wonder how the 
nation has managed to survive up to this point. An alternative interpretation, of 
course, is that the NAEP proficiency levels themselves are amiss. 


Myth 3: Twenty Percent of Our Children Are Dyslexic 

Closely related to the previous misconception that 40 percent of our students read 
below the “basic” level is another portentous-sounding figure that has been circulated 
widely in the media and in many academic circles; namely, that 20 percent of United 
States schoolchildren suffer from a “neurobehavioral disorder” known as “dyslexia,” 
often referred to simply as a “reading disability” (Shaywitz et al., 1996). The research 
most often cited to support this claim is drawn from the results of the Connecticut 
Longitudinal Study (CLS), a large-scale project funded in part by the National Insti¬ 
tute of Child Health and Human Development (Shaywitz, Escobar, Shaywitz, Fletcher, 
& Makuch, 1992; Shaywitz, Fletcher, Holahan, 8c Shaywitz, 1992; Shaywitz, Fletcher, 
8c Shaywitz, 1994; Shaywitz, Shaywitz, Fletcher, 8c Escobar, 1990). The CLS tracked 
over 400 students from kindergarten through young adulthood, periodically measur¬ 
ing their Intelligence Quotient (IQ), reading achievement, and mathematical abilities, 
among other attributes. CLS researchers measured “reading disability” by two meth¬ 
ods. The first is what is known as “discrepancy scores,” which represent the difference 
between a child’s actual reading achievement and what would be predicted based 
upon his IQ. The idea is that if you have a high IQ but are poor at reading, then some¬ 
thing must be wrong with you. The actual size of the discrepancy used in the CLS 
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Table 1.3 

Children Classified as Reading Disabled in the Connecticut 
Longitudinal Study (N = 414) 

GRADE 1 GRADE2 GRADE3 GRADE5 

Number 25 32 31 24 

Percentage 6.03% 7.76% 7.78% 5.79% 

(from Shaywitz et al., 1990, p. 998; S. Shaywitz et al., 1992, p. 147) 


studies was that recommended by the United States Department of Education, 1.5 
standard deviations. This 1.5 standard deviations figure was thus their “cutoff” score 
used to determine who was reading “disabled” and who was not. Table 1.3 shows the 
number and percentage of children at four grade levels who were determined to have 
reading disabilities out of a total of 414 children. 

In any given year, then, a little less than 8 percent fall into the category of read¬ 
ing disabled, using the 1.5 cutoff. TWo important things need to be noticed about these 
results. First, and most importandy, the 1.5 standard deviations cutoff point is, like all 
such cutoff points for minimal competence, arbitrary. We could just as easily have used 
1.25 or 1.75 or .5, each producing a different percentage of “neurobehaviorally” af¬ 
flicted children—the sky’s the limit. Second, even the 8 percent represented in Table 
1.3 have not been shown in this research to be “dyslexic,” if by “dyslexic” we mean a 
“neurologically based disorder in which there is unexpected failure to read,” the defini¬ 
tion used by the CLS team (S. Shaywitz et al., 1992, p. 145; emphasis added). This is 
because, quite simply, no neurological measures were administered to these particular 
children. 3 All that can be said from these findings is that around 8 percent of children 
in any given year will have a discrepancy of 1.5 standard deviations between their IQ 
and reading achievement, at least if they live in Connecticut 

The second approach used in classifying“dyslexic” children was to set the cut¬ 
off point for reading disabilities at the 25th percentile on a norm-referenced reading 
test regardless of the child’s IQ. In this case, the reading tests came from the Wood¬ 
cock-Johnson Psycho-Educational Battery (W-J) (Woodcock 8c Johnson, 1977). Us¬ 
ing the second-grade data (a total of 415 children), B. Shaywitz et al. (1992) found 
63 children who scored below the 25th percentile cutoff (25 of whom also met the 
discrepancy criterion, the method described above), or a total of 15.18 percent. Seven 
others met the discrepancy criterion but not the percentile cutoff. Adding them to 
the total count, we get 70 out of 415, or 16.86 percent Round it up, and you get 20 
percent of United States children suffering from dyslexia. 

It is necessary to recall how percentiles are arrived at in standardized tests in 
order to see that this second method of classifying “dyslexia” is no less arbitrary than 
the first. A standardized test like the W-J is typically given to a large sample of stu¬ 
dents who (ideally) are representative of the population as a whole that will use the 
test (in the case of the W-J, this “norming sample” of children came from 40 different 
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communities tested back in the mid-70s [Woodcock & Johnson, 1977, pp. 28-29]). 
With a large enough representative sample, the scores will generally have something 
close to a “normal distribution,” meaning that the spread of scores from low to high 
resembles the well-known bell curve. The majority of students will score somewhere 
in the middle, with progressively fewer at the top and bottom of the curve. The aver¬ 
age or “mean” score in such a normally distributed, “unskewed” bell curve will be the 
point at which 50 percent of the students score above and 50 percent score below. 
Similar points along the distribution represent the scores below which a certain per¬ 
centage of the sample population falls. The 25th percentile, for example, is the point 
below which 25 percent of the students scored. If you give this same test to another 
sample of students like the norming sample, you will find —by definition —that 
around 25 percent will score below the cutoff score for the 25th percentile. 

Now, the CLS researchers found that, in their particular sample, only 15 percent 
of the students fell below the original W-J sample’s 25th percentile score. The only 
conclusion that can be drawn from this result is that the children in the CLS sample 
were not comparable to the 1977 W-J sample and perhaps were somewhat better read¬ 
ers. But absolutely none of this adds up to the conclusion that 20 percent of United 
States children are dyslexic. That’s because there is nothing significant about the 25th 
percentile cutoff point used in the CLS (or the 20th or the 30th or the 16th) in divid¬ 
ing those who do and do not have a “neurobehavioral” disorder, any more than the 1.5 
discrepancy score used in the first method. Other researchers using a different crite¬ 
rion score would have come up with a different percentage. As in the case of the pre¬ 
ceding myth, there is simply no objective way of arriving at such a determination from 
reading tests. The best we can say is that a certain percentage of students read below 
this arbitrary cutoff at this age. We can then choose to label (or mislabel) this group 
in a certain manner, but we have no reason to conclude, at least based on the CLS evi¬ 
dence, that they suffer from any neurological disorder or other biological defect. 4,5 


Myth 4: Children from the Baby Boomer 
Generation Read Better than Students Today 

Some might object that signs of stability over the past two decades are misleading, that 
we must go back even farther than the 1960s and 1970s to observe large drops in read¬ 
ing achievement, perhaps to the 1950s or 1940s. McGuinness (1997), for example, ar¬ 
gues that children who learned to read between 1955 and 1965 are better readers than 
those educated in more recent years. Her evidence comes from a study of adult literacy 
levels, the National Adult Literacy Survey (NALS), which was given by the United States 
Department of Education to a representative sample of United States adults in 1992 
(Kirsch, Jungblut, Jenkins, & Kolstad, 1993). McGuinness notes that those between the 
ages of 35 and 44 at the time of the survey—and hence those who learned to read in 
the mid-1950s to mid-1960s—have higher reading scores than those of subsequent 
generations. This is proof, she claims, that schools are not as good as they once were. 

What McGuinness is proposing here is problematic, to say the least: Can we 
really measure the effectiveness of schools 40 years ago by how well their graduates 
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read today? What about the intervening 30 years of reading experience and educa¬ 
tion? We should hardly expect the reading proficiency of these adults to remain stag¬ 
nant over time. Surely the reading scores of this same group of 35-44-year-olds from 
when they were still enrolled in school are better indicators of how well they per¬ 
formed as children than the NALS test, since fewer intervening variables exist to con¬ 
found the results. We do, in fact, have reading achievement scores from a 
representative sample of children of this age cohort in the form of the high school 
NAEP scores from 1971 (for those who entered first grade in 1959 and were 38 at 
time of the NALS administration). As can be seen in Table 1.1, their scores are not 
much different (and slightly lower) than more recent graduates, indicating that baby 
boomers were not better readers as they exited schools than are today’s graduates. 

Moreover, an analysis by M. Cecil Smith (1995) of the same NALS data cited by 
McGuinness found that when we compare adults of similar educational levels —an im¬ 
portant control, since higher levels of education lead to higher reading scores—those 
adults who reached majority age after 1987 (age 19-24 at the time of the survey) actu¬ 
ally have slightly higher proficiency than older readers of similar educational experience 
who went through the school system back in the 1940s, 1950s, and 1960s. This superi¬ 
ority in reading achievement by young adults holds true on most of the literacy mea¬ 
sures used in the study, even when a person’s current reading habits are considered. In 
addition, 19- to 24-year-olds are just as likely to be avid readers as their older counter¬ 
parts, whether we look at the reading of books, newspapers, or magazines. 6 

Additional evidence against McGuinness’s claim comes from commercial tests 
that, while not given to a representative sample of United States students like the 
NAEP, are normed on a large group of schoolchildren and thus in some sense indica¬ 
tive of national performance. As Berliner and Biddle (1995) and Bracey (1997a) 
point out, the scores on these tests have been stable or rising since the 1950s. 7 The 
results of general intelligence tests, which date back to the early thirties, have also 
been climbing steadily since then (Berliner & Biddle, 1995), making any assertions 
about the superiority of a mythical “golden age” of education in the United States 
somewhat suspect. 


Myth 5: Students in the United States Are Among 
the Worst Readers in the World 

What will come as most surprising to many people is how the United States com¬ 
pares internationally in reading achievement: Our nine-year-olds ranked second in 
the world in the most recent round of testing conducted by the International Asso¬ 
ciation for the Evaluation of Educational Achievement (IEA); our fourteen-year-olds 
ranked a very respectable 9th out of 31. United States schoolchildren are truly world- 
class in reading.® Table 1.4 reports the top ten countries at each age level tested, taken 
from Elley (1992). 

It is important to note the range of scores at the two age levels. The United 
States came in second among nine-year-olds, with the next best-scoring country, 
Sweden, scoring 9 points lower and the fourth ranking nation, France, 16 points 
lower. Among the fourteen-year-olds, the United States ranked ninth out of 34, but 
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Table 1.4 

Average Scores of 9-Year-Olds and 14-Year-Olds in Reading Achievement 


9-YEAR-OLDS 14-YEAR-OLDS 


COUNTRY 

MEAN SCORE 

COUNTRY 

MEAN SCORE 

Finland 

569 

Finland 

560 

United States 

547 

France 

549 

Sweden 

539 

Sweden 

546 

France 

531 

New Zealand 

545 

Italy 

529 

Hungary 

536 

New Zealand 

528 

Iceland 

536 

Norway 

524 

Switzerland 

536 

Iceland 

518 

Hong Kong 

535 

Hong Kong 

517 

United States 

535 

Singapore 

515 

Singapore 

534 


(from Elley, 1992, Tables 3.1 & 4.1) 


this is somewhat deceiving, since the United States was virtually tied with 5 other 
countries after fourth-place New Zealand. 

Too Good to Be True? 

A dissenting opinion on just how well United States schoolchildren perform over time 
and internationally is held by Walberg (1996), who argues that reading achievement has 
in fact declined since the early 1970s. Walberg compared the IEA scores from 1990-91 
noted in Table 1.4 to the first IEA test given to 15 nations in 1970, with the scores from 
the two tests equated (Lietz, 1995, cited in Walberg). Walberg concluded that the scores 
did indeed decline, from 602 in 1970 to 541 in 1991 (using his adjusted scores). 

Two problems exist with this analysis, however. First, it is not clear why the two 
IEA tests given 22 years apart should be preferred in measuring trends in United 
States reading performance over the United States Department of Education’s own 
NAEP exam, which has not only been given more frequently (9 times since 1970), but 
was designed to be much more sensitive to a broader range of reading achievement 
(Binkley & Williams, 1996) than the IEA tests. Second, the IEA test has changed con¬ 
siderably since its first administration in 1970 (Elley, 1994). Unfortunately, the re¬ 
analysis of the scores upon which Walberg bases his comparisons is unpublished, 
making it difficult to know precisely how these “equated” scores were derived from 
what were markedly different tests. Elley, Schleicher, and Wagemaker (1994) note 
that common passages were used in the 1970 and 1991 assessments, but there is no 
indication in Walberg’s report of whether the scores being compared were based 
upon these sections of the test. 
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Walberg also finds evidence for failing United States performance in a Euro¬ 
pean study (Organization for Economic Cooperation and Development, 1995) in 
which scores of “reading efficiency” were computed for each country that partici¬ 
pated in the IEA’s 1990-91 test administration. This efficiency score comes from sub¬ 
tracting a country’s average score for its nine-year-olds from that of its 
fourteen-year-olds, dividing by the age difference, and multiplying by five (for the 
five years difference) (OECD, 1995, p. 207). The result is said to indicate how much 
students “gained” or “lost” in the intervening period. Students in the United States, 
according to this measure, gained 124.9 points, less than the average gain of 159.6 
points. From this Walberg concludes that United States schools don’t do a very good 
job in reading achievement with students after the fourth grade or so. 

But Bracey (1997b) notes that, by this same measure, the top-scoring country 
in the world, Finland, had the third worst “reading efficiency” among the countries 
compared, whereas the worst-scoring country in the OECD sample—Denmark— 
had the greatest efficiency, since its scores climbed the most between ages nine and 
fourteen. This should lead us to think that something may be amiss in the measure 
itself. In fact, Bracey points out that the relationship between a country’s ranking on 
the OECD’s efficiency scale for its nine-year-olds and its fourteen-year-olds was 
strongly negative (rank order correlation = -.69), suggesting that this rather odd 
measure of “progress” depends more on how poorly you start out rather than on true 
increases or declines in reading achievement. 

Another critic, Andrew Coulson (1996) agrees that reading achievement has 
been mostly stable, but he still sees signs of decline among our high school seniors. 
He notes that comparable measures of adult literacy given in 1985 and 1992 (the 
Young Adult Literacy Survey and the National Adult Literacy Survey, respectively) to 
twenty-one- to twenty-five-year-olds showed “real” declines. A closer look at the data 
shows, in fact, rather small drops: Among whites, the decline was a mere 9 points on 
a 500 point scale, from 305 to 296. The largest drop was among Latinos, but the au¬ 
thors of the report note that there was a huge increase in the number of adults tak¬ 
ing the test for whom English was not their first language, which would depress test 
scores regardless of how well schools were performing in reading instruction. Once 
again, there is little basis for arguing that reading performance has recently declined. 
Neither Walberg s nor Coulson’s evidence, when examined more closely, show that 
students today are performing any worse than did previous generations. 

Myth 6: The Number of Good Readers Has Been 
Declining) While the Number of Poor Readers 
Has Been Increasing 

It is of course conceivable that, while the average performance of United States stu¬ 
dents has remained the same, the number of students “at the top” has been declin¬ 
ing, and this indeed has been claimed by some critics (e.g., Murray & Herrnstein, 
1992; Coulson, 1996). Considerable attention has been paid, for example, to the re¬ 
sults of the Scholastic Aptitude Test (SAT), and slight declines in the numbers of stu- 
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Table 1.5 

Percentage of Students Scoring at Highest Performance Levels 
for Age on NAEP, 1971-1994 


AGE 

1971 

1975 

1980 

1984 

1988 

1990 

1992 

1994 

17-Year-Olds 

7 

6 

5 

6 

5 

7 

7 

7 

(Above 350) 

13-Year-Olds 

10 

10 

11 

11 

11 

16 

15 

15 

(Above 300) 

9-Year-Olds 

17 

16 

19 

18 

19 

20 

17 

18 


(Above 250) 

(from Table 8.1, p. 130 in Campbell, Reese, et al., 1996) 


dents scoring at the SATs highest levels (Hirsch, 1996). While it is true that the num¬ 
ber of students scoring above 700 on the SAT did decline, the numbers were never 
high (2.3 percent in 1966, 1.2 percent in 1995). More importantly, the large demo¬ 
graphic changes in United States schools over the past three decades have almost cer¬ 
tainly had an influence on these small drops in test scores. As Bracey (1997a) points 
out, the drops occurred primarily between 1966 and 1972, since which time the per¬ 
centage of students scoring above 700 has remained stable. Claims of a general de¬ 
cline over many years are not warranted. Moreover, two studies that have attempted 
to control for the significant demographic shifts in the test pool since the early 1950s 
have found that the average declines during the 1960s and 1970s were rather small 
(see Bracey, 1997a). 

But the most important point to keep in mind when discussing the SAT is that 
it is not a representative sample of United States high school students; it is not even 
a representative sample of college-bound seniors. It is a voluntary test that a large 
proportion of students take in some states (e.g., Connecticut, New York) and hardly 
any students take in other states (e.g., Iowa, North Dakota). 9 The NAEP tests, by con¬ 
trast, are representative, and they indicate no decline in the percentage of students 
scoring at the highest levels, as Table 1.5 illustrates. 

The performance levels given in Table 1.5 are generally the highest for which a 
given age group has scored since the NAEP tests began. 10 As can be seen, little change 
has occurred in the percentage of high-scoring students at any grade level, with the 
percentage of thirteen-year-olds scoring at the top levels showing an increase over 
the past three decades. 

The flip side of the supposed decline of the “best and brightest” is that the 
number of poor readers has been increasing. Once again, the results from the nation¬ 
ally representative samples on the NAEP test show this not to be the case, as indicated 
in Table 1.6. The number of poor readers has remained fairly stable among all age 
groups (using the lowest cutoff points given by the NAEP reports). 
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Table 1.6 


Percentage of Students Below Lowest Performance Levels by Age 
on NAEP, 1971-1994 


AGE 

1971 

1975 

1980 

1984 

1988 

1990 

1992 

1994 

17-Year-Olds 
(Below 200) 

4 

4 

3 

2 

1 

2 

3 

3 

13-Year-Olds 
(Below 200) 

7 

7 

5 

6 

5 

6 

7 

8 

9-Year-Olds 
(Below 150) 

9 

7 

5 

8 

7 

10 

8 

8 


(from Table 8.1, p. 130 in Campbell, Reese, et al., 1996). Scale is 0 to 500. 


Myth 7: California’s Test Scores Declined 
Dramatically Due to Whole Language Instruction 

Some critics have not been satisfied with merely finding a crisis (or illness) where 
none exists. It has also become necessary to produce a guilty party to blame for our 
greatly exaggerated woes (Levine, 1996; Stewart, 1996). In this case, the villain is an 
approach to the teaching of reading called “whole language.” The focus of these at¬ 
tacks have centered primarily on California, a state that at least nominally adopted a 
more “holistic” view of teaching language arts back in 1987 that, it is claimed, led to 
a steep decline in reading scores. California has become a national test case in read¬ 
ing instruction, and since many of the skills-oriented solutions to our current read¬ 
ing “crisis” are in large part a reaction to what happened there, it is worth examining 
the situation in some detail. 

Two points are at issue in the case of California and its reading crisis. First, did 
California’s reading test scores really “plummet” (Stewart, 1996, p. 23) to record lows 
after 1987? Second, was this sharp decline attributable to the adoption of a reading 
curriculum in the state in 1987 (CRTFR, 1995) that emphasized reading books and 
decreasing (but not eliminating) phonics and skills instruction? It turns out that the 
answer to both of those questions is “no ” 

California Dreamiri: The Surf Was Never Up 

The popular wisdom about California’s decline stemmed mostly from the release of 
two sets of test scores: the 1992 and 1994 NAEP scores, and results of the state’s own 
California Learning Assessment System (CLAS). In both the 1992 and 1994 state 
NAEP rankings, California fared rather poorly: Of the 41 participating states and ter¬ 
ritories in 1992, California ranked 38th; in 1994, it was tied for last (Campbell, 
Donahue, Reese, 8c Phillips, 1996). Rank scores are, of course, slightly misleading, 
since a difference of a few points can cause a state to rise or drop in rank if the states 
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are clustered closely together. But even when this clustering is taken into account, 
California still did not fare well: In 1992, the state was in the bottom third, and in 
1994, in the bottom quarter (Campbell, Donahue et al., 1996). Clearly, California was 
performing relatively poorly compared to the rest of the nation. 

But performing poorly is not the same as getting worse. To show a decline, one 
must look at scores from both the beginning and the end of the time period in ques¬ 
tion. Herein lies the problem: State-level NAEP scores are unavailable before 1992, 
and the tests are not equivalent to any other standardized reading measure. As such, 
the NAEP data cannot tell us anything about whether scores went up or down after 
the implementation of the literature-based curriculum. 

The second set of data cited to show California’s supposed drop, the Califor¬ 
nia Learning Assessment System (CLAS), suffers from the same lack of comparabil¬ 
ity. The CLAS tests were administered only twice, in the 1992-93 and 1993-94 school 
years. T il»- the NAEP scores, they are not comparable to previous test scores and tell 
us nothing about how California students read before and after the 1987 Framework 
adoption. (In fact, the two years are not even comparable to each other, due to 
changes made in the test after the first year of its administration.) Ironically, the 
1993-94 results indicated that more than three-fourths of California’s students could 
read at a “basic” level (McQuillan, 1996, in press: d). 

The only test score data available both before and after the implementation of 
the “holistic” 1987 Language Arts Framework are the California Achievement Pro¬ 
gram scores. As can be seen in Table 1.7, there is no indication of dramatic drops or 
increases. Some scores went up, some went down, but for the most part things were 
remarkably stable. California’s third graders, for example, were performing at the 
same level in 1990 as they were in 1985, which would be unlikely if the 1987 Frame¬ 
work resulted in a dramatic decline in test scores." 

Despite repeated claims to the contrary, then, there is no evidence that Cali¬ 
fornia reading scores have declined markedly over the past ten years, and sufficient 
data to suggest that there has been little change in achievement since the 1987 Frame¬ 
work adoption. This stability of test scores is remarkable in light of the declining fi¬ 
nancial status of the California’s schools, a decline that has been largely ignored in 


Table 1.7 

California Achievement Program Raw Scores for 3rd, 6th, 8th, 
and 12th Grades, 1984—1990 


GRADE 

1984 

1985 

1986 

1987 

1988 

1989 

1990 

3rd 

268 

274 

280 

282 

282 

277 

275 

6th 

249 

253 

260 

260 

265 

262 

261 

8th 

250 

240 

243 

247 

252 

256 

257 

12th 

236 

241 

240 

246 

250 

248 

251 


(from Guthrie et al., 1988,1993) 
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the debates over reading achievement in the state. Indeed, the teachers of California 
are to be commended for managing to keep scores from plunging given the limited 
resources they have been allotted (see Chapter 7). 

Whole Language to Blame? 

The second part of the argument used to promote a renewed emphasis on skills in¬ 
struction was that whole language was the cause of California’s (nonexistent) decline 
and (very real) low national ranking. As noted earlier, it is true that California ranked 
relatively low among the states. Is a literature-based curriculum or whole language 
to blame? Another look at the 1992 NAEP data reveals that the answer appears to be 
“no.” As part of the assessment, fourth-grade teachers were asked to indicate their 
methodological approach to reading as “whole language,” “literature-based,” and/or 
“phonics.” The average scores for each type of approach were then compared, and 
those children in classrooms with heavy emphasis on phonics clearly did the worst. 
Children in whole language-emphasis classrooms (reported by 40 percent of the 
teachers) had an average score of 220, those in literature-based classrooms had a 
score of 221 (reported by 49 percent of the teachers), and students in phonics class¬ 
rooms (reported by 11 percent of the teachers) came in last with an average score of 
208 (NCES, 1994, p. 284). Similar patterns were observed in the 1994 NAEP results 
(Campbell, Donahue et al., 1996). If these data on teacher-self-report are accurate, 
and reading method is a primary determinant of reading scores in a state, there is 
little reason to blame whole language for California’s reading woes. 12 


* * * 


Many things are wrong with United States schools, and, as I stated at the be¬ 
ginning of this chapter, I have no interest in ignoring legitimate problems we face in 
the area of literacy development. But false crises and distorted views of student 
achievement can only distract us from the real concerns of parents, teachers, and 
policymakers. Instead, we need to have some understanding of what reading is and 
know some of the most important factors influencing reading achievement. In the 
following chapters, I examine some basic theories of language and literacy acquisi¬ 
tion, a few competing explanations of the critical variables related to reading achieve¬ 
ment, and what I believe to be the real barriers to improving literacy in our 
low-achieving schools. 



2 


Getting Started: 

What Do You Need to Learn 
How to Read? 


T he subject of how children learn how to read has attracted an extraordinary 
amount of attention, both from the popular media and in the scientific lit¬ 
erature. Indeed, the intense focus on early reading has taken on the overtones 
of an all-out war in which opposing sides are thought to do battle. The purpose of 
this chapter is to attempt to lay out a general framework for understanding how chil¬ 
dren learn to read, setting out in an admittedly simple but, I hope, comprehensible 
manner some of the principal factors that need to be taken into account in explain¬ 
ing why some children succeed where others fail. Once that is done, it will be easier 
to understand where the true controversies lie in the debates over reading, and which 
evidence is appropriate in judging competing prescriptions about what schools 
should do to help struggling readers. 


A General Model of Language Acquisition 

Stephen Krashen (1981,1996b) has proposed that we acquire languages when we 
engage in a single, all-important act: when we understand messages. More precisely, 
Krashen states that acquisition results from the comprehension of messages that con¬ 
tain elements of language slighdy above our current level of competence. This gen¬ 
eral relationship has been supported by evidence in both first (LI) and second (L2) 
language acquisition for both oral (Asher, 1994; Cross, 1977; Hart 8c Risley, 1995; 
Krashen, 1981,1985; Swaffer 8c Woodruff, 1978) as well as written language (Ander¬ 
son, Wilson, 8c Fielding, 1988; Elley 8c Mangubhai, 1983; Elley, 1991; Krashen, 1993; 
Taylor, Frye, 8c Maruyama, 1990).' 

Figure 2.1 is a representation of this general model of language acquisition, 
where the act of comprehending messages results in language acquisition.The notion 
that we acquire language by understanding messages is also central to Ken and Yetta 
Goodman’s (Goodman, 1967; Goodman 8c Goodman, 1979) and Frank Smith’s 
(1973, 1994) theories of literacy acquisition. I will refer to this idea that reading 
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Figure 2.1 

A General Model of Language Acquisition 



development is a result of understanding meaningful written language as the 
Goodman-Smith Model, since these researchers have been the most instrumental in 
providing the intellectual and empirical basis for it. 2 

How Do We Understand Messages ? 

It is useful to consider just what takes place when we try to make sense of language, 
specifically written language, since our focus here is on reading. As it turns out, read¬ 
ing involves using both the information that is present on the written page, as well 
as the information we already have in our minds. Smith (1994) notes that reading is, 
like all comprehension, relating the known to the unknown. We do not begin our 
encounter with written language (sometimes called simply print or text) with a 
tabula rasa, or blank slate; we are able to reduce the uncertainty of what is on the 
page by what we already know. As an experienced reader picking up a book in En¬ 
glish, for example, you know where to look for the beginning, which side is up, where 
the page numbers are likely to be located, and so forth, based on your previous ex¬ 
periences with text. Similarly, when you look at words on a page, you know that only 
so many likely alternatives exist for the way letters combine to make words, words 
combine to make sentences, and, on a larger scale, sentences combine to convey the 
particular meaning of a text. Of course, we are sometimes wrong, sometimes sur¬ 
prised, and on occasion we read more slowly or reread to understand the (unlikely) 
alternatives we encounter. It takes longer to choose among more alternatives, so that 
the more unlikely the concept or word or even letter, the more time we need to de¬ 
cide which option “fits” or makes sense. This highlights the fact that comprehension 
is a normal activity where complete uncertainty is relatively rare; reading is some¬ 
thing we expect to make sense. 

The information we use to reduce uncertainty—what Smith refers to inter¬ 
changeably as “nonvisual” information, background knowledge, or “context”—in¬ 
cludes all we know about language: how markings on the page form letters and 
words, the meaning of words, how sentences go together, the topic of what is read, 
the sounds letters make, and many other things. Some of this background knowledge 
will be more useful than others, and some (such as the sounds letter make) may only 
be used as a last resort, when all other ways of making sense have broken down. 

Ironically, as Smith (1994) points out, we use the actual visual information, the 
markings on the page, as little as possible to make sense of what we read. To attempt 
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to focus on each letter or each word would overload our capacity to process infor¬ 
mation, like the slow reader who, by the time he or she has reached the end of a sen¬ 
tence, has forgotten the first word. Luckily, our background information is so 
powerful and able to reduce uncertainties at every level of reading—word, sentence, 
and beyond—that fluent reading can take place despite the enormous amount of vi¬ 
sual information present in the text. 

Reading is making sense of text, then, at the highest level possible: We “read” 
not individual letters or even words, but meaning as our eyes scan the page. When 
we come across an alternative that surprises or confuses us, we slow down and, per¬ 
haps, focus on a smaller group of words. If that doesn’t help, we may look at each 
word to try to understand what is taking place. If the word itself is not familiar to us, 
we may resort to other strategies to figure out its meaning, including trying to 
“recode” it into sound by groups of letters or even, as a final resort, by individual let¬ 
ters. Notice, however, that this chain of events takes place only if comprehension 
breaks down, only if, as we are going along extracting meaning from the larger units 
of visual information (sentences, phrases), we run into trouble, confused by some 
unlikely alternatives to what we think should come next. 

Beginning readers, of course, have less knowledge of how the printed page is 
constructed, and less nonvisual information to reduce the number of alternatives they 
often face when reading. When comprehension breaks down, they may more often 
resort to ways of attempting to get the meaning of a particular word, including the 
possibility of using the sounds of the word’s letters. This does not mean, however, that 
sound is essential to reading, or that fluent reading necessarily involves such a letter- 
by-letter, sound-by-sound “decoding.” The use of sound in reading is one strategy for 
trying to make sense of the written text, but by no means the only or primary one. 


Access, Assistance, and Acquisition 

Knowing how we go about comprehending written language does not give us a very 
complete picture, however, of why some people read well and others not so well or 
not at all. To help answer these questions we need to expand our model outlined 
above by integrating into it two additional factors: the access we have to written lan¬ 
guage “input” (that is, reading materials) and the types of external assistance we may 
receive in helping us understand these written messages. Variations in the amount 
of reading materials available for us to comprehend will in part determine over time 
how much we read, and as a consequence, how well we read. Likewise, the types of 
outside help we get in understanding written messages from sources aside from the 
text itself—such as from teachers, parents, and the like—may facilitate (or hinder) 
how well we can make sense of what we read. 

Access 

What is often ignored in discussions of language and literacy acquisition, but logically 
implied in our Expanded Model of reading acquisition (refer to Figure 2.2 on page 20), 
is a source of written language input, or more simply, reading materials. We know from 
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studies of second-language (L2) acquirers, for example, that access to language input 
as measured by how long a person lives in a country is positively related to the person’s 
level of language acquisition, assuming that the messages encountered are comprehen¬ 
sible (i.e., the person is at least at an intermediate level of proficiency) and that the per¬ 
son has interaction with their surroundings (Krashen, 1976). Similarly, evidence exists 
that access to written input—reading materials—is also associated with more literacy 
development (Elley, 1992; Krashen, 1985, 1989,1995). Although clearly one must ac¬ 
tually “receive” the input by listening or reading in order to acquire language, large 
variations in the supply of input indirectly affect the level of language acquisition, and 
may explain large individual differences in proficiency. 3 One of the central tenets of this 
book is to argue that access to reading materials (written language input) is perhaps as 
important if not more so than other explanations of reading difficulties. 

Two Types of Assistance 

Once we have access to written language, how are we able to understand it? As noted 
above, our previous experiences with text will give us a great deal of useful knowledge 
in making sense of print. But we can also receive assistance from other, more capable 
readers, especially during our initial attempts at reading. This outside help given to 
us as children learning how to read is often simply called “instruction,” but in order 
to understand more precisely how this assistance plays a role in reading, it is useful to 
classify it into two types: “metalinguistic” assistance and “elaborative” assistance. The 
claim here is that both types of “instruction” or assistance may be helpful in compre¬ 
hending written language, even though neither is, strictly speaking, required. 

Metalinguistic Assistance 

The term metalinguistic means literally “beyond language,” and is used here to mean 
knowledge about the way in which the language system itself works, a knowledge that 
we have explicitly and consciously (in Krashen terms, “learning” [1981]). 
Metalinguistic assistance is thus the help we get from more capable readers about 
how written language is put together. This includes things such as being provided 
information about how sounds and letters correspond, doing exercises aimed at im¬ 
proving one’s awareness of how speech can be broken down into smaller units of 
sound, and being taught rules of grammar. Such explicit knowledge may be useful 
to us at times in understanding written input, especially when our background 
knowledge and the context of the printed page are insufficient to help us make sense 
of the message being conveyed. 

Smith (1994) gives a good example of how one type of conscious, metalinguistic 
knowledge can be of help in comprehending text. Suppose a child is confronted with 
the following sentence but is unable to read the last word: 

The man came riding in on his horse. 

Knowledge that the h at a beginning of a word makes a certain sound would reduce 
the number of possibilities that the reader had to choose from. She would know, for 
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example, the word could not be car or train . Explicitly teaching a child the sounds 
that letters make could contribute somewhat to comprehension of text and thus in¬ 
directly to reading acquisition. 

Giving metalinguistic assistance is what I will refer to as “formal instruction” 
For beginning literacy, this typically takes place in formal settings like school when 
children are taught the alphabet and how sounds and letters go together (“phonics”), 
although parents sometimes also give their children limited metalinguistic knowledge 
at home by teaching them letter shapes and sounds (Durkin, 1966; Yaden, Smolkin, 
8c Conlon, 1989). Note, however, that one can possess metalinguistic knowledge with¬ 
out being taught it—such knowledge may, that is, be discovered on one’s own. 
Moustafa (1995,1997) points out that children may come to understand how sounds 
and letters correspond through inductive use of analogy, comparing the parts of 
known words with unknown words. For example, a child who can read green and back 
can figure out how to pronounce nonsense words like grack by this very strategy. 

One of the long-standing controversies in reading instruction is just how 
much, if any, formal, metalinguistic instruction children need to learn how to read. 
Some have claimed that it is absolutely essential in reading acquisition (e.g., Share, 
1995), and that the presence or lack of such instruction can explain why some chil¬ 
dren ultimately succeed or fail in reading. Others have taken the position advocated 
here, that metalinguistic assistance is but one type of help (“instruction”) children 
can receive, helpful to some limited extent for some children when desired or re¬ 
quested, usually in the context of trying to make sense of text. 

Elaborative Assistance 

By “elaborative assistance” I mean all other types of help given readers to make sense 
of language other than that related to how the language system itself is constructed; 
that is, information other than metalinguistic instruction. In the case of a child ac¬ 
quiring her first language, elaborative assistance may consist of things such as “se¬ 
mantic contingency” (continuing on about the topic introduced by the child’s 
previous statement), a slower rate of speech, restatement of the message, using “here- 
and-now” topics, and similar devices (Newport, Gleitman, 8c Gleitman, 1977; Snow, 
1983). All of these actions or strategies help children understand the messages being 
directed at them by giving them additional clues as to the meaning of the input they 
are exposed to. Similarly, second-language teachers use elaborative methods such as 
rephrasing or restating their messages, slowing their rates of speech, using props or 
gestures to give clues to their meaning, all in an attempt to make the language more 
comprehensible (Freed, 1980; Krashen, 1981). The focus is not on the structure of 
language system per se, but on understanding the meaning of the message. 

In the acquisition of written language, elaborative assistance may be provided 
through verbal explanations of the story, giving the child an oral explanation of the 
background or the meaning of words. It may include reference to illustrations of the 
text, and how they help convey the message of the printed page. Studies of storybook 
reading, for example, show that parents give precisely this type of elaborative assis¬ 
tance about the story by describing the pictures, explaining the meaning of words, 
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and giving background information on topics that appear in the story (Snow 8c 
Ninio, 1986; Smolkin, Conlon, 8c Yaden, 1988; Yaden, 1988). The focus of elabora- 
tive assistance is on making the meaning of the written language comprehensible, 
rather than consciously describing the language system itself. 

Reformulation of the Model 

Figure 2.2 restates our hypothesis for reading acquisition with the clarifications 
noted above, adapting that proposed by Krashen (1993). The model predicts that an 
increase in access to reading materials will tend to lead to an increase in the amount 
of reading, which in turn leads to higher levels of reading acquisition. This reception 
can be facilitated by both elaborative and metalinguistic external assistance, although 
neither is necessary. 4 


Counter Arguments on Reading 

Several researchers have advocated positions contrary to the model outlined above. 
It has been claimed, for instance, that metalinguistic assistance plays not just a help¬ 
ful role in learning how to read, but a critical one, and that certain forms of explicit 
instruction must be provided to nearly all children. As a consequence, some often 
expensive and sweeping pedagogical proposals to improve reading achievement have 
been proposed that include, among other things, a strong emphasis on “training” 
children in certain metalinguistic “skills” that are thought essential to reading. Un¬ 
derstanding the factors that influence reading acquisition is thus important not 
merely from a theoretical perspective, but from a practical one as well. The reason is 
simple: The resources available to schools to improve literacy achievement are lim¬ 
ited. Money devoted to programs to provide, for example, more intensive skills in¬ 
struction is money that will not be available for other things, such as books or 
libraries or helping teachers. It is therefore necessary to discuss the objections and 
opposing arguments to the Goodman-Smith view of reading, objections that we will 
group under four major headings (Stanovich, 1993): 

1. The Role of Phonology and Metalinguistic Instruction in Reading: It is claimed that 
explicit, metalinguistic instruction is essential for the children to learn how to 
read, and by extension that lack of such instruction can explain variations in 
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reading achievement. This instruction must include (at minimum) training in 
how sounds correspond to letters, and how speech can be broken down into 
smaller units of sound (“phonemic awareness”). 

2. The Use of Context: It is claimed that good readers actually make less use of “con¬ 
text” than poor ones while reading. This finding is said to disprove the 
Goodman-Smith view that reading is a process involving the use of background 
knowledge to reduce the number of unlikely alternatives in making sense of text. 

3. Eye Movement Research: Recent studies on the manner in which the eyes fixate 
on words in a text are thought to show that context does not facilitate reading, 
again contrary to what Goodman and Smith are said to have proposed. 

4. Method Comparisons: Studies comparing methods of teaching reading that empha¬ 
size metalinguistic assistance such as training students on sound-letter correspon¬ 
dences (“phonics”) are said to produce superior results to “meaning-emphasis” 
teaching methods such as “whole language.” This indicates that resources are best 
spent focusing instruction on such metalinguistic training to improve reading 
proficiency. 

The next three chapters are devoted to addressing the rather complex set of is¬ 
sues raised in the first objection, that sound and metalinguistic awareness are crucial 
to reading acquisition and fluent reading. Chapter 3 will review studies on the ne¬ 
cessity of metalinguistic assistance in order to learn how to read. Chapter 4 will dis¬ 
cuss more indirect but still very suggestive evidence on the types of assistance readers 
benefit from by examining cases of “early” readers. Chapter 5 focuses on the claims 
made concerning phonological awareness and reading. The last three objections are 
all covered in Chapter 6, which will discuss research on context, eye movements, and 
method comparison studies. Finally, in Chapter 7,1 argue that efforts to improve 
reading achievement need first to be focused on issues of access to reading materials, 
without which no instructional program can succeed. 



3 


What Does It Take 
to Get the Alphabetic Principle? 


O ne of the central theoretical and practical issues in reading acquisition is the 
role of formal instruction as we’ve defined it here: explicit, metalinguistic 
assistance. Is it possible, for example, that children can learn to read with¬ 
out such instruction? Is consciously learning about the written language system ab¬ 
solutely essential for all children learning how to read, or is its contribution more 
modest, one of being perhaps helpful to some children but not the key variable in 
initial reading acquisition? Our position stated in the previous chapter is that while 
some children can be assisted by a small amount of explicit teaching under certain 
circumstances, they benefit primarily from being exposed to print in a meaningful 
environment with appropriate forms of elaborative assistance. 

Central to the dispute over what role metalinguistic instruction has in reading 
acquisition (and subsequendy the amount of resources that should be devoted to it) 
is the proposition that, at least in English, children must first grasp the “alphabetic 
principle” in learning how to read. The alphabetic principle has been defined by 
Share (1995) as knowledge of both how sounds connect to letters (sound-letter cor¬ 
respondence) and “phonemic awareness,” the conscious knowledge that speech can 
be segmented or divided somewhat artificially into minimal units of sound (“pho¬ 
nemes”). Phonemic awareness is the ability to divide, say, the word “cat” into three 
separate phonemes, “cuh,” “ah,” and “tuh,” or in proper linguistics notation, /k/, /a/, 
and /t/. A grasp of the alphabetic principle is thought to give us the ability to “recode” 
or “sound out” written words, converting letters into sounds. 

Two claims, then, have been made concerning this relationship: (1) the alpha¬ 
betic principle (sound-letter correspondence and phonemic awareness) is necessary 
for a child to learn how to read; and (2) children must typically be taught the two 
above-mentioned components explicitly (Adams, 1990; Ehri & Wilce, 1985; Share, 
1995; Stanovich, 1993). Share, for example, has stated that preliterate children who 
are exposed to print have “not been found to spontaneously induce discovery of the 
alphabetic principle,” and that therefore “explicit instruction” is necessary to “bring 
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the decoding possibilities of an alphabetic orthography to a child s attention” (1995, 

p. 160). 

The first claim has been disputed by a number of other researchers. Smith 
(1973, 1994) and Goodman (1996), for example, have argued that the ability to 
recode words into sound is not a prerequisite to reading, although it remains one 
resource at a child’s disposal in her attempts to make sense of text. (We will discuss 
the role of phonology in reading more thoroughly in Chapter 5.) 1 This chapter will 
focus instead on the claim that children must be explicitly taught the alphabetic prin¬ 
ciple in order to acquire it. I will argue first that the studies cited by Share (1995) and 
others, while consistent with a strong version of what we may call the “Instruction 
Hypothesis,” are insufficient to support the claim that explicit instruction is neces¬ 
sary (rather than merely helpful); and second, that it is equally plausible that the al¬ 
phabetic principle, and more importandy, the ability to read, come primarily from 
exposure to comprehensible text with appropriate elaborative assistance, even with¬ 
out any significant amount of explicit, metalinguistic instruction. 

Two Views on the Naturalness of Reading 

Goodman and Goodman (1979) have posited that the acquisition of reading shares 
fundamental similarities with oral language acquisition: “Children learn to read and 
write in the same way and for the same reason that they learn to speak and listen. 
That way is to encounter language in use as a vehicle for communicating meaning” 
(p. 138). Drawing upon Hallida/s (1969) categorization of the functions of oral lan¬ 
guage, Goodman and Goodman state that written language is acquired for similar 
uses and along a similar developmental path. The focus of both the beginning and 
proficient reader, Goodman and Goodman contend, is always on meaning, and a 
“high level of conscious awareness of the [linguistic] units and system” is not neces¬ 
sary to their task. Such metalinguistic knowledge of text (e.g., letter-sound corre¬ 
spondences) is acquired largely as a consequence of that concentration on meaning. 
In other words, explicit metalinguistic knowledge is not needed before one begins 
reading, but is largely developed while reading. Smith (1983) makes a similar point 
by stating that learning about the particular properties of written language comes 
most often from “hearing written language read aloud” (p. 47) and not primarily 
from a program of metalinguistic instruction. 

Several theorists have criticized this position and have instead proposed that 
initial reading acquisition consists of movement through a series of stages that at 
some point requires explicit, metalinguistic assistance. Gough and Hillinger’s (1980) 
two-stage model is probably the most straightforward of these proposals and illus¬ 
trates the basic principle of the others (Ehri, 1992; cf. Stuart & Coltheart, 1986). 
Their first stage is “logographic” or “paired associate” learning of words, where chil¬ 
dren use distinct visual features of the word itself or the context associated with text 
as “cues” to remember the word. Hence the child might focus on the “hook” of the / 
in John to identify this word, or the double, stacked circles of the letter B to remem¬ 
ber Bob . Gough and Hillinger argue, however, that this first-stage strategy is limited 



24 • THE LITERACY CRISIS 


and unproductive; eventually the child runs out of distinctive features with which to 
“memorize” in this fashion. They hold instead that readers must at some junction 
make reference to the sound a word would make in order to store it in memory, since 
“it defies the imagination to suppose that the child could learn 50,000 items [an 
adult’s reading vocabulary] by arbitrary associations” (p. 186). 2 

The second stage is “cryptoanalysis,” or what others have called “alphabetic” 
reading (Ehri, 1992). Here children learn that letters or groups of letters represent 
sounds, and that this correspondence allows one to “recode” written words into 
sound, which (if in the child’s oral vocabulary) he can then identify. In other words, 
this stage is marked by the acquisition of the alphabetic principle. Ehri (1992) breaks 
this stage up into three intermediate stages (partial, full, and consolidated alpha¬ 
betic), but the end result is the same: Children are able to link letters or groups of 
letters to the pronunciation of words. Eventually, children are able to identify most 
words automatically without this constant recoding because “the brain processes the 
word’s pronunciation and meaning without any effort directed at decoding the 
word” (Gaskins, Ehri, Cress, O’Hara, & Donnelly, 1996, p. 317). Gough and Hillinger 
argue that the movement from logographic to alphabetic word identification cannot 
be accomplished by children on their own—it is a qualitative “jump” that requires 
explicit, metalinguistic assistance. 

Share (1995) advances a slightly different view of how children learn to read, 
which he terms an “item-based” rather than “stage-based” acquisition of recoding 
ability. Rather than posit a primarily logographic stage followed by an alphabetic 
one, Share holds that children use phonological information to identify words at 
various stages depending on their frequency and familiarity, and that only less famil¬ 
iar words that appear relatively infrequently (“low-frequency words”) require 
recoding. Even so, the initial ability to recode must normally be taught, according to 
Share, so in practice his position is similar to Ehri’s and Gough’s. 


Problematic Evidence on the Acquisition 
of the Alphabetic Principle 

Two types of studies have been cited to support the position that formal, 
metalinguistic instruction is necessary for acquiring the alphabetic principle. The first 
type are examinations of the effects of “environmental print” exposure; that is, writ¬ 
ten language that commonly appears in the environment of children usually associ¬ 
ated with some symbol or logo, such as “McDonald’s” along side a pair of golden 
arches. The second type of evidence are “training” studies of children who are not yet 
able to read independently, where they are exposed to text and then tested on their 
ability to perform tasks related to sound-letter correspondence or “decoding.” Each 
type of evidence suffers from its own particular shortcomings that make the available 
data ambiguous at best. The environmental print studies fail to take into account the 
types of contextual support available to children in their encounters with print. The 
training studies underestimate the duration and amount of input that may be needed 
for acquisition to take place. Let us take a look at examples of each type of evidence. 
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Environmental Print Studies: 

The Problems of Overspecification 

The distinction between metalinguistic and elaborative assistance noted in Chapter 
2 dealt with help given by another, more proficient reader, such as a parent, teacher, 
or older sibling. Clearly this is not the only, or even most common, type of “elabora¬ 
tive” help available to children learning to read. Text is often found in environments 
that contain what is referred to as “extralinguistic” information, or information in 
addition to the language itself. This includes pictures, illustrations, logos, or symbols 
that give clues to the meaning of the text and render assistance from a more capable 
reader unnecessary. As Krashen has pointed out (personal communication), these 
clues can sometimes provide so much information that reference to the actual text is 
unnecessary for comprehension. The result is text that is overspecified. An illustrated 
book where the entire story can be understood perfectly makes the text redundant 
and is an example of an overspecified text. Conversely, the text could have too few 
extralinguistic clues for a beginning reader to comprehend the message, making it 
underspecified. The ideal situation is when elaborative clues are somewhere between 
these two extremes. When text is properly specified, the background information the 
reader brings to bear in comprehending the text helps her reduce the number of pos¬ 
sible alternatives in meaning, but does not eliminate them. Some attention to the text 
itself must occur to understand the message. 3 

Acquisition under properly specified conditions may be incremental in nature, 
similar to that described by Nagy, Herman, and Anderson (1985) in their examina¬ 
tion of vocabulary acquisition. Nagy and his colleagues found that unknown words 
can be acquired gradually by a reader through several exposures to them in mean¬ 
ingful contexts, with the reader picking up a bit more of the meaning with each pass. 
Under such conditions, exposure to text may lead to incremental increases too small 
to be detected by all-or-nothing measures such as simple pass/fail tests (i.e., know / 
don’t know the word). 

Discussions of whether being exposed to meaningful print alone can lead to 
certain aspects of reading acquisition must take these distinctions into account. We 
would not expect, for example, that children exposed to “environmental print” that 
contained text embedded in a lot of extralinguistic clues to necessarily acquire any 
insights into the linguistic elements of the text, since the context surrounding the 
print in effect replaces the text in conveying the meaning. For example, seeing the 
golden arches helps a child identify McDonald’s without any attention paid to the 
actual name. 

This prediction is confirmed by the available research. Masonheimer, Drum, 
and Ehri (1984) studied one hundred three- to five-year-old children who could 
identify at least eight popular “environmental print” signs (e.g., Pepsi, Carl’s Jr.). The 
children were tested to see whether they could read those same words without the 
contextual support provided by the pictures or graphic information that normally 
accompanies them (e.g., no golden arches with the word McDonald*s). In addition, 
these environmental print “experts” were also given tests to see whether they could 
recognize some common words drawn from beginning or “primer” reading materi- 
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als printed in isolation on index cards. Of the one hundred children, only six of the 
children were able to read any of the primer words or read more than one of the eight 
environmental print words without the accompanying contextual support. The re¬ 
searchers conclude that reading words does not "evolve directly out of environmen¬ 
tal print experiences” (p. 265). A partial replication of Masonheimer et al. by Stahl 
and Murray (1993) yielded essentially the same results. 

As stated earlier, this interpretation is correct: Children do not need to attend 
to the text when the extralinguistic clues convey all of the required meaning. Expo¬ 
sure to this type of print is not sufficient to lead to the ability to read "bare” text in 
and of itself, nor has that been claimed (Goodman, 1986; Harste, Burke, 8c Woodward, 
1982). Children may need exposure to other types of text—properly specified—for 
the alphabetic principle to be acquired. 4 Exposure to environmental print may lead, 
however, to incremental increases in other types of print knowledge, as we shall see 
below. In any case, environmental print studies cannot be considered supporting evi¬ 
dence that metalinguistic instruction is critical to reading acquisition. 

Short Exposures and Impoverished Input 

Some variation exists among children in terms of the rate of acquisition of various 
linguistic elements in oral language, even though they appear to follow a general de¬ 
velopmental path (Brown, 1973) over an extended period of time. Similarly, we 
should expect that acquisition of the aspects of written language will take place at 
varying rates over a long period, and may require considerable amount of exposure 
to input. It appears untenable to posit, for example, that an infant exposed only to a 
few hours of a language containing a given grammatical structure would extract all 
of the necessary linguistic information contained in the input to acquire that struc¬ 
ture. Such a test is equally inappropriate in written language acquisition. Yet the 
claim that explicit metalinguistic instruction is essential for learning how to read is 
based in part on studies that provide such limited input. The following three studies 
are among those cited by Share (1995) and illustrate the methodological flaws of this 
sort of evidence: 

Byrne (1992): Byrne conducted a series of studies to show that children need to 
be taught that speech can be broken down into smaller units—phonemes—and 
how these phonemes correspond to letters. In one experiment, for example, a 
group of eleven children were taught the words bat and fat in isolation. Most 
children were able to identify the two words after about ten minutes. This train¬ 
ing constituted the entire "exposure to print” for the experiment The subjects 
were then given several new words representing minimal pairs, distinguished 
only by their first letters, either b or/(e.g., bell and felh Wgand fig). On average, 
children could not distinguish the new words above the level of chance, although 
individual scores were not provided. 

The amount of print the children were exposed to—two words—and the 
ten minute duration make Byrne’s conclusions very questionable. Byrne himself 
acknowledged that two items may not be enough for children to acquire the req- 
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uisite phonemic representation. In a subsequent experiment, he increased the 
amount of words shown to the children—to four, still clearly not sufficient to 
test any hypothesis on the effects of print exposure. 

Seymour and Elder (1986): Seymour and Elder’s study is cited by Share (1995) 
and others as demonstrating that even long-term exposure to text does not lead 
children to gaining insights into the alphabetic principle. The researchers looked 
at twenty-six children (ages 4.5 to 5.5) in a first grade classroom over a year’s 
time. The children were taught to read by the “Look-Say” method plus some 
“phonics-based” instruction in spelling. The teacher used a “whole-word” 
method, where words were “taught [120 words] by presentation on flash cards 
and in illustrated booklets for practice in identification and naming” over the 
course of one school year (p. 6). The emphasis on memorizing individual words 
can be seen in the way the researchers, one of whom spent about two days per 
week in the classroom, discussed “reading” with the children: “Do you remem¬ 
ber any [words] you learned this week?” and “Is this one of your reading words?” 
(p. 19). The test of the children’s derivation of letter-sound correspondence was 
a series of isolated word lists given throughout the year, on which appeared both 
taught and new words. Very few of the children were able to pronounce any of 
the novel words. 

Despite a long period of exposure, Seymour and Elder provided the chil¬ 
dren in this study with very impoverished input. It appears that little connected, 
meaningful print was used in the instruction. Seymour and Elder demonstrated 
that focusing on individual word memorization via flash cards in a print-poor 
environment is not a good method of teaching reading. 

Ehri and Sweet (1991): Ehri and Sweet examined which factors were associated 
with success in “fingerpoint-reading” by children. Thirty pre-readers who had re¬ 
ceived no formal instruction in reading nor had participated in storybook read¬ 
ing in school were given pretests in letter knowledge, isolated word recognition, 
and phonemic awareness, among other things. The treatment consisted of the ex¬ 
perimenters reading a story to the child several times, pointing to the words as 
they did, followed by the child reading and pointing to the words. The storybook 
sessions took place over two days, with a total of forty to sixty minutes per child. 
Following the sessions, the child was asked to read isolated lines of text, isolated 
words in the text, and point to certain words in the text as the experimenter said 
them. Among the conclusions of the authors were that “even novice word read¬ 
ers [those who could recognize at least one preprimer word prior to the treat¬ 
ment] did not learn to read many individual words out of context as a result of 
fingerpointing-reading practice” (p. 454). 

Even in this brief exposure, some gains were apparently made by the chil¬ 
dren. Ten of the subjects (28 percent) were able to tell the difference between the 
text they had read in the story and an altered text substituted in the same book, 
and five children (14 percent) who could not read any of the preprimer words 
on the pretest were able to read some of the words when presented with the 
complete text and exhibited the ability to segment blends into phonemes. This 
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illustrates they may have acquired some knowledge of print/letters even from 
the short exposure, similar to the incremental acquisition noted earlier in the 
vocabulary acquisition studies by Nagy (e.g., Nagy, Herman, and Anderson, 
1985). 5 Other studies discussed by Share (1995) suffer from similar flaws of very 
brief exposures to a limited amount of comprehensible text (Carnine, 1977; 
Byrne 8c Fielding-Barnsley, 1989; Ehri 8c Wilce, 1987a, 1987b; Scott 8c Ehri, 
1990). 

Acquisition of the Alphabetic Principle Without 
Metalinguistic Instruction: Suggestive Evidence 
on Elaborative Assistance 

The evidence put forth by Share and others to support the necessity of consciously 
teaching children about the metalinguistic properties of written language in early 
reading acquisition is insufficient or at best ambiguous. Other studies, while not 
without faults, suggest instead that the critical element in learning how to read, in¬ 
cluding the acquisition of the alphabetic principle, is exposure to comprehensible 
text. Reading acquisition appears to take place as long as text is provided in appro¬ 
priate, comprehensible contexts, and is of sufficient quantity and duration. 

There are several studies of how children first come to make sense of written 
language, done by researchers in the beginning or “emergent” literacy field. These 
investigations have examined the effects of exposure to print over a long period of 
time among young children, usually before they begin attending school, and typically 
with more varied types of input than those used in the experimental training studies 
discussed above. While it is not, in most cases, possible to determine whether the 
children in these studies have received systematic, formal metalinguistic instruction, 
in most instances it seems unlikely, or at the very least, that the metalinguistic assis¬ 
tance given was rather limited. Research on early readers (Durkin, 1966,1974-1975), 
for example, has found that some (but not all) parents teach children letter names 
and perhaps letter shapes, but this does not typically extend to explicit instruction 
in areas such as phonemic awareness. It is possible, of course, that children do gain 
such knowledge incidentally from these activities, but this fails far short of the sort 
of instruction prescribed by those who believe in formal teaching of the alphabetic 
principle. 

One such study, McGee, Lomax, and Head (1988), examined the way in which 
young children read not only environmental print signs and labels, but also “func¬ 
tional” print, defined as the types of text found in the home (e.g., a grocery list, a tele¬ 
phone book, a coupon) not associated with specific logos, pictures, or symbols. This 
latter category would more likely include print that is what we defined above as 
“properly specified”—that is, print within a context that gives the reader some back¬ 
ground information to reduce the number of possible meanings, but that does not 
completely eliminate uncertainty and thus requires some attention to the actual text. 
McGee and her colleagues interviewed eighty-one children, ages three to five, attend¬ 
ing preschool and kindergarten classes. Using a detailed coding scheme, they deter- 
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mined what children could read from both functional and environmental categories 
of text, and what details they attended to in attempting to comprehend the text in 
question. Children were classified by whether they could read eleven or more isolated 
words from a list (“expert”), one to ten words (“novice”), or no words (“nonword’). 
All the words were drawn from beginning-reader books. 

Little difference was found in the percentage of children among the three cat¬ 
egories who could read the environmental print example presented to the children, 
confirming that this is probably the first type of print that children attend to, but not 
one that leads immediately to independent reading. However, the expert readers at¬ 
tempted to read the functional print items at a much higher rate (79 percent) than 
the novice (51 percent) or nonword readers (28 percent). This is an indication that 
functional print may be the next step along a developmental path taken by some 
early readers. It may be, that is, that children encountering written language may 
move from very “contextualized” environmental print, with its visual information to 
help determine the meaning, to the less contextualized functional print, with its fa¬ 
miliar uses but smaller number of extralinguistic clues. 

Even some of the nonword and novice readers who made attempts to read the 
functional items did so by attending to the text itself (the “graphic detail”), not just 
the surrounding symbols or drawings on the page. While McGee (1986) later com¬ 
mented that the percentage of nonword readers who attended to the specifics of the 
text (16 percent) was low, an alternative explanation is that there exists a gradual de¬ 
velopmental sequence in early readers as they acquire knowledge about text. All this 
is consistent with our hypothesis that knowledge about print may grow slowly as ex¬ 
posure to text increases, eventually leading to the ability to read independently. 6 

Further evidence that children seem to grow in their knowledge of written lan¬ 
guage comes from a study by Lomax and McGee (1987). Here the researchers tested 
children ages three to seven on a variety of measures, including their “concepts about 
print” (e.g., knowing which side of the book is up or that the print moves from left 
to right), their awareness of the sounds letters make and how words can be broken 
apart into smaller, “phonemic” units (such as the example of “cat” as “cuh”“ah” and 
“tuh”), and their ability to read simple words presented in isolation. Lomax and 
McGee’s results, partially summarized in Table 3.1, show that some awareness of en¬ 
vironmental and functional print as well as letter-sound correspondence exists even 
among very young children. 

Environmental print awareness is one of the first stages of literacy acquisition 
for these children, with three-year-olds identifying nearly two-thirds of the environ¬ 
mental print words when presented in context. Awareness of functional print items 
is also present at an early age, and more importantly, continues to increase up 
through the time children begin formal schooling. Substantial increases in letter- 
sound correspondence knowledge and phonemic awareness occur between ages 
three and four and four and five. The ability to read words in isolation, indicating 
acquisition of the alphabetic principle, is not seen at all until age five, however. 

One interpretation of these trends in the data shown in Table 3.1 is that expo¬ 
sure to print leads to gradual acquisition of the two components of Share’s (1995) 
definition of the alphabetic principle, phonemic awareness and letter-sound corre- 
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Table 3.1 

Growth in Print Knowledge Over Time 
TOTAL 


ITEM 

POSSIBLE 

AGE 3 

AGE 4 

AGES 

AGE 6 

Environmental Print 
(in context) 

20 

73.5% 

(14.70) 

87.2% 

(17.44) 

93.8% 

(18.75) 

97.8% 

(19.56) 

Functional Print 

40 

35.8% 

(14.30) 

56.8% 

(22.70) 

75.3% 

(30.10) 

81.4% 

(32.56) 

Phonemic Awareness 
(initial sound) 

20 

20% 

(4.00) 

58.7% 

(11.74) 

86.5% 

(17.30) 

95% 

(19.00) 

Letter Sounds 

21 

21.2% 

(4.45) 

53.8% 

(11.13) 

87.5% 

(17.50) 

97.2% 

(20.44) 

Word List 

100 

.05% 

(•05) 

.83% 

(.83) 

27.2% 

(27.15) 

75.4% 

(75.39) 


(adapted from Lomax 8c McGee, 1987; Table 2, p. 246) 


spondence, which in turn leads children to acquire the “insight” that can be used to 
recode words in isolation. 7 Indeed, Lomax and McGee (1987) found through a so¬ 
phisticated statistical procedure (structural equation modeling analysis) that their 
data were consistent with a developmental path. 


Conclusion 

Serious problems exist with the evidence cited to show that children cannot acquire 
the alphabetic principle without formal instruction. First, researchers have failed to 
take into account the fact that environmental print is generally overspecified. Be¬ 
cause children can derive the necessary meaning of many logos and signs without 
attending to the text, reception of this input does not necessarily lead to any acquisi¬ 
tion of specific features of the writing system. Other forms of written input, or envi¬ 
ronmental print that is more properly specified, would appear to be needed to make 
the text more salient and comprehensible. 

Although it may not always lead direcdy to independent reading, environmental 
print does serve a function in helping children learn to read. As Goodman (1986) and 
others have noted, children begin to see that written language has a purpose and mean¬ 
ing, even if they are not paying close attention to the “graphic” detail of the letters and 
words embedded in it. The evidence is consistent with the hypothesis that acquisition 
is a process beginning with environmental and functional print, moving through stages 
of greater attention to graphic detail in an attempt to comprehend texts. This acquired 
knowledge of print and the way it functions appears to lead ultimately to the ability to 
read some words independently in context and, finally, in isolation. 
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More research is needed on precisely what types of exposure help children ac¬ 
quire initial literacy. Snow and her colleagues (Snow & Goldfield, 1982,1983; Snow 8c 
Ninio, 1986) conducted a series of studies on the discourse of “storybook” reading be¬ 
tween children and parents, and Yaden (Smolkin et al., 1988; Yaden et al., 1989; Yaden, 
1988) has looked at the types of elaborative questions children ask during storybook 
reading sessions. Both of these sets of studies give us some insight into the nature of 
adult s assistance to pre-readers. Little is known on precisely what types of elaborative 
assistance or contextual help beginning readers acquire certain aspects of text. Possible 
exceptions to this gap in the research are Ehri’s studies on fingerpointing-reading (Ehri 
and Sweet, 1991) and Samuels’ earlier work (Samuels, 1967) on illustrations and iso¬ 
lated word learning, but neither of these are necessarily representative of normal print 
exposure for early readers. 

A second problem encountered in the evidence on acquiring the alphabetic 
principle was the unrealistic estimations of the quantity and duration of input that 
are likely to be needed to acquire the information contained in the input. The very 
short exposures used in most training studies, some no more than a few minutes, and 
the extremely impoverished input (a few words) are not likely to lead to any easily 
measurable acquisition of linguistic information, and are completely inadequate to 
test whether children can acquire such information from exposure alone. Research 
from oral language (Pinker, 1994) and other aspects of written language acquisition 
(Nagy, Herman, & Anderson, 1985) suggest that many aspects of the language are 
acquired over time, often over a period of years, not minutes. This of course places 
severe limitations on the type of evidence available to answer this question, since it 
is almost impossible to observe children carefully over long periods of time to see 
what sort of assistance they might receive when learning to read. 

It should be again emphasized that our alternative interpretation of the avail¬ 
able evidence, like Shares (1995) version, is still very much open to debate. This is 
due in part to the nature of the phenomenon being studied. The naturalistic inquiry 
required to determine if print exposure can lead to the alphabetic insight by neces¬ 
sity lacks the experimental control needed to give us a clean test of the hypothesis. 
Strong claims that systematic, explicit metalinguistic instruction is necessary (versus 
merely helpful) for children to learn how to read are not warranted by the data cur¬ 
rently available, however. 
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C hildren can learn to read at different ages, in different settings, and with dif¬ 
ferent forms of adult assistance. In this chapter, we will examine additional 
evidence from a variety of sources on how learning to read is accomplished 
through appropriate exposure to comprehensible written language, with the help of 
both elaborative and, on occasion, some metalinguistic instruction. Much of this evi¬ 
dence comes from children learning to read outside the classroom and, as such, has 
its limitations. One cannot always be certain of the types of help children are given 
outside of school, yet the data we will discuss here is at least suggestive that elabora¬ 
tive assistance and access to print may be more important in assuring long-term 
reading success than other factors. I will begin by reviewing the research on "early 
readers,” children who learn to read before arriving at school; next, I will analyze sev¬ 
eral studies on the relationship between the age children learn to read and their long¬ 
term reading proficiency; and finally, I will discuss some of the potential pitfalls of 
programs that focus on instruction versus access in helping struggling readers. 

“Natural” and Precocious Readers 

The issue of whether children need systematic metalinguistic training in order to 
learn how to read can be addressed in part through several case studies that have 
been published on "natural” readers. Goodman and Goodman (1982), for example, 
reported that their daughter Kay learned to read on her own, and at age six-and-a- 
half could read at a fifth-grade level on a standardized measure of reading. They re¬ 
port that their daughter began reading "with no instruction” independently at the 
age of five years, six months. Despite a lack of formal teaching, Kay’s reading did not 
come without what we have termed elaborative support from her parents. Her envi¬ 
ronment was “rich in language experience. She has always been read to, listened to, 
and talked to. Singing, poetry, nursery rhymes, and oral family language games are 
daily fare in her home” (p. 221). 
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Torrey (1969) investigated the case of John, who learned to read without any 
adult instruction at age five. John’s mother, who had a tenth-grade education, re¬ 
ported that no one had “read to or taught him” to read. He did, however, watch a 
great deal of television, where many children’s shows had objects labeled. Torrey 
notes that John was not pressured by his parents into reading. His exposure to suffi¬ 
ciently specified print—in games and on television—was apparently sufficient to al¬ 
low him to begin reading independently. Forester (1977) cites the cases of two girls, 
a five-year-old and four-year-old, both of whom learned to read without instruction. 
The five-year-old began reading after her grandmother gave her a common form of 
elaborative assistance: reading to her. The four-year-old began reading after listen¬ 
ing to tape recordings of children’s books. Forester notes that the children took “ob¬ 
vious pleasure” from their reading (p. 164). 

Case study and survey evidence on learning to read without formal instruction 
is also found in the research on “precocious” readers, usually defined as those chil¬ 
dren who are reading at a high level (at second- or third-grade level) upon entering 
first grade. The earliest reports of such readers date back to the first part of this cen¬ 
tury (reviewed in Coltheart, 1979; and Torrey, 1979). More recently, Henderson, 
Jackson, and Mukamal (1993) reported on the case of Max, a young boy who learned 
to read without formal instruction at the age of three. As in the case of John dis¬ 
cussed above, Max seemed to benefit from early television viewing in learning let¬ 
ters and words. His parents report that after watching the television game show 
Wheel of Fortune, Max appeared to know the names of many letters and could read 
certain labels on food packages at the age of one year, nine months. When Max was 
two, his mother began offering some metalinguistic assistance by using letter mag¬ 
nets to spell words out for him on the refrigerator related to kitchen activities. His 
mother also started to write short stories for Max about the day’s events and read 
these to him, in addition to reading regular storybooks to him twice a day. 

Survey studies confirm that a certain number of children learn to read with¬ 
out school instruction. Durkin (1966) conducted two studies in which she examined 
49 precocious readers in California and 30 in New York. In the California study, 
Durkin found that more than half of the children received only occasional instruc¬ 
tion from parents and/or siblings in learning to read, although all of the parents re¬ 
ported that they read to their children. Being read to was also the most important 
factor indicated by the children themselves as motivating them to learn how to read. 
Similar results were found in the New York study, where all of the parents of early 
readers reported reading to their children. My own research based on data collected 
from a more representative sample of United States parents has found that perhaps 
as many as one out of every ten children begins reading outside of school (McQuillan, 
in press: a), suggesting the phenomenon may not be as rare as previously thought 

Jackson, Donaldson, and Mills (1991, cited in Jackson 8c Lu, 1992) studied 116 
children who were reading at a third-grade level or higher upon entering first grade, 
including some children whose first language was not English (Jackson 8c Lu, 1992). 
BCing and Friesen (1972) found 31 children who were reading upon entry into kin¬ 
dergarten. Some of these children undoubtedly learned to read with adult assistance. 
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but there is no evidence that they did so with any systematic, formal instruction usu¬ 
ally associated with school—phonics training, phonemic awareness exercises, and 
other skill-building activities. 

Some common characteristics have been identified in the backgrounds of these 
precocious readers. First, as Jackson (1988) comments, parents of precocious read¬ 
ers “rarely give the impression of having pushed children toward goals that the chil¬ 
dren did not share” (p. 203); that is, children were not pressured to read before they 
felt ready. Second, precocious readers were usually given extensive elaborative assis¬ 
tance by being read to by their parents (Jackson, 1988; Plessas & Oakes, 1964; Teale, 
1978; Torrey, 1979). Of course, these same activities may have led to the phonologi¬ 
cal insights that are thought by some to be critical to learning to read (Goswami 8c 
Bryant, 1990). Evidence also exists that early readers receive certain simple types of 
metalinguistic assistance, such as learning the names of letters (Briggs 8c Elkind, 
1977), but this, again, seems to be much less extensive than the systematic instruc¬ 
tion typically advocated for early literacy instruction. The case of Max reported in 
Henderson et al. (1988) may be an instance of this, where watching a letter-game on 
T.V. and having his mother spell words for him was sufficient assistance to allow him 
to recode. But since this assistance occurred within the context of considerable elabo¬ 
rative help, it is difficult to say whether it was necessary or merely helpful. Third, 
early readers come from environments where they saw how text conveyed meaning, 
including seeing adults who engaged in reading and being continually exposed to 
new books in the home (Brenna, 1995). Finally, early readers usually show a strong 
desire to read and like to read independently for pleasure (Forester, 1977; Brenna, 
1995). Given proper conditions of a supportive environment, some children clearly 
can and do learn to read without being explicitly taught in formal settings, and in 
many cases, with little assistance other than being given easy access to print and be¬ 
ing read to by others. 1 

All this is not to say that all children would be equally successful in similar en¬ 
vironments with abundant print access and elaborative assistance to help make texts 
comprehensible. But the option should at least be considered. Poor readers often re¬ 
ceive significantly less exposure to print and instead are given intensive meta¬ 
linguistic assistance in remedial programs (Allington, 1983), in some sense the 
opposite of what successful readers experience. As we will learn in the next chapter, 
these attempts to offer large amounts of metalinguistic assistance have had only 
modest results in improving real reading, and there is still little evidence that even 
these modest gains are long-term. 

Age of Learning to Read 
and Later Reading Achievement 

While it is dear that some young children can and do learn to read without formal 
instruction, there remains the related question of whether learning to read by a cer¬ 
tain age per se provides children with longer-term benefits in reading achievement. 
Two types of research design have been used in studies that attempt to answer this 
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question. It is important to distinguish between the two and the kind of information 
each design can give us. The first type is a nonexperimental, “static group” design 
(Campbell & Stanley, 1966), where the researchers take a group of children already 
reading by the age of school entry and compare them to a group of children who are 
nonreaders. The two groups’ reading scores are then compared over a period of years 
in order to determine the effects of the age at which reading began on eventual read¬ 
ing achievement. 

Nonexperimental Designs 

At least four studies have made such comparisons, and all four have found that early 
readers maintain their advantage over nonreaders into the later years. Durkin’s 
(1966) investigation, discussed above, of 30 early readers and 30 nonreaders at school 
entry found that the early readers were still reading at a higher grade level than their 
first-grade nonreader classmates in the third grade. 

Sutton (1969) found similar results with a group of 46 early readers and 59 
nonreaders; the early readers were still better by grade three, although she did not 
analyze her data for statistical significance. Morrison, Harris, and Auerbach (1971) 
looked at 58 early reading first graders in New York, all of whom outscored the non ¬ 
reader first-grade children through the third grade. The differences in reading com¬ 
prehension at grade three were significant. 

Tobin and Pikulski (1988) tracked a group of early readers and nonreaders 
through grade six, and reported that the early readers continued to score significantly 
higher than the nonreaders in a number of standardized measures. At the end of 
sixth grade, the early readers did better on the Gates-MacGinitie Reading Tests (to¬ 
tal reading mean scores: early reader = 630.8 [40.1), regular readers = 582.6 [37.2]; 
calculated effect size from test statistic = 1.2), as well as on informal measures of oral 
reading, reading speed, mean instructional level, silent reading, and comprehension 
questions. All of these differences were statistically significant with moderate to large 
effect sizes. 

There is a problem with using the evidence provided in these studies, however. 
When using static group comparisons, a risk exists that factors other than the one 
being studied may account for the differences between the two groups. In the case of 
the early readers, it may be that some factor other than the age at which they began to 
read is the cause of the difference between reading scores in the later grades. 

A closer examination of the early reader groups in the studies reviewed above 
reveals the presence of such factors. In Durkin (1966), for example, four of the five 
selected case studies of one early reader group found that their homes had a large 
number of books. Since exposure and access to print is a strong predictor of even¬ 
tual literacy development (Krashen, 1993), it is likely that the continued gains made 
by these early readers is attributable to the environment of the home over the years 
studied, and not to the age they began reading per se. Sutton reported that the me¬ 
dian level of the father’s schooling in her early reader group was 15 years, compared 
to 12 years for the nonreader fathers. Morrow (1983) found that children in the 
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homes of better-educated parents showed more interest in books and had more 
books in the home. While education is only a rough proxy for the print environment 
in the home, it is at least plausible that the early reader/nonreader groups differed in 
their access to books. Tobin and Pilulski (1988) and Morrison, Harris, and Auerbach 
(1971) provided no data on the home print environment of their subjects, but given 
the evidence on the literacy backgrounds of other early readers noted by Jackson 
(1988) and Plessas and Oakes (1964) above, it again seems reasonable to suppose that 
there were differences in access between their groups as well which help account for 
the long-term success of the early readers, rather than attributing this to the age at 
which they began to read. 

Experimental and Quasi-Experimental Designs 

A second and more reliable study design is an experimental or quasi-experimental 
one, where children are (ideally) assigned randomly to two groups, or at least 
matched according to certain important variables. One group receives early instruc¬ 
tion in reading and one does not. In this way we can see whether the age at which 
reading begins itself is responsible for the long-term differences in reading develop¬ 
ment. Table 4.1 summarizes the results of four such studies, all of which found that 
the gains made by readers who begin to read “early” are short term in nature and/or 
have little to no long-lasting effects on reading proficiency. Since each study used dif¬ 
ferent standardized measures and reported their results with different sample statis¬ 
tics, I have converted the data, when available, into effect sizes to make a more 
general comparison of the results (Light 8c Pillemer, 1984). Effect sizes allow us to 
compare findings of the studies using a common metric. 

In Bradley’s (1956) study, two groups of 31 first graders were matched on sev¬ 
eral measures, including the father’s socioeconomic status. One group received in¬ 
struction at the beginning of first grade, according to normal school practice. The 
second group was divided into three subgroups according to what the teacher deter¬ 
mined was their “reading readiness.” These three groups began to receive reading in¬ 
struction when the teacher believed them ready; the first subgroup at five months 
after school entry, the second at eight months, and the third at ten months after 
school entry. The early reader group did better during the initial tests given two 
months into grade two, but those differences disappeared by the ninth month of that 
same school year, with the “delayed” readers actually doing slightly better at the end 
of grades two and three. Note in Table 4.1 that the effect size of the early reading 
treatment declines steadily throughout the three years, from +1.8 to -.67. 

Gray and Klaus (1970) studied 61 at-risk children who were randomly as¬ 
signed to three groups: One group participated in a ten-week summer reading and 
reading readiness program each of the three summers prior to the group’s entry into 
first grade, another group attended summer school two summers prior to first grade, 
and the third group received no special instruction. The differences among the three 
groups were significant only in the second grade, with the control group doing a bit 
better than both groups in the first grade and actually outscoring the group that re- 
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Table 4.1 


Experimental Studies of Early Readers 



GRADE 

SIGNIFICANT 



READING 

DIFFERENCE 



SCORES 

FAVORING 

EFFECT 

STUDY 

MEASURED- 

EARLY READERS? 

SIZE b 

Bradley (1956) 

November, 2nd 

Yes 

+ 1.8 


June, 2nd 

No 

-.06 


June, 3rd 

No 

-.67 

Gray and Klaus 

June, 1st 

No 

n/a 

(1970) 

June, 2nd 

Yes 

n/a 


June, 3rd 

No 

n/a 

Durkin 

September, 1st 

Yes 

+.86 

(1974-1975)' 

March, 2nd 

Yes 

+.59 


March, 3rd 

No 

+.34 


March, 4th 

No 

+.50 

Hanson and 

Fall, 12th 

Yes 

+.10 

Farrell (1995) 





- Standardized reading comprehension scores were used for all studies. 
b favors early reader group. Effect sizes were determined according to formulas pro¬ 
vided by Wolf (1986) from the sample statistic used in the original study. 
c Effect size calculated from the mean reading raw scores reported by Durkin. 


ceived the earliest instruction in all three grades. It should be noted, however, that 
despite the early intervention, there is no indication that the experimental group was 
actually reading by the beginning of first grade, and therefore may not have been a 
true case of early readers. 

Durkin (1974-1975) compared two groups (total of 33) of four-year-olds ran¬ 
domly assigned to two treatment groups, one that received special reading and read¬ 
ing readiness instruction, and one that attended a normal kindergarten program. 
Her six-year study showed that while the early reader group did better in the first and 
second grades, no significant differences were present by grades three and four. While 
the early reader group still outscored the nonreader group at the end of the study, 
the advantage was marginal, with the effect sizes exhibiting an overall declining trend 
as the children got older. 

Finally, Hanson and Farrell (1995) looked at a large sample (n = 3,959) of high 
school seniors, about two-thirds of whom had participated in a kindergarten early- 
reading program. The experimental program, the Beginning Reading Program 
(BRP), was carried out during the school year 1972-73 in 24 of the 25 school dis¬ 
tricts that took part in the longitudinal study. The researchers gave measures of read¬ 
ing comprehension to all those who were high school seniors in these same districts. 




38 * THE LITERACY CRISIS 


The combination of those students from the non-BRP district plus those students 
who transferred into the BRP districts after kindergarten gave the experimenters a 
control group with whom to compare the students who learned to read in the BRP 
program. The design of the study was posttest only—that is, only the reading scores 
from grade twelve were used to compare the possible effects of the BRP treatment— 
and assignment of the students to either the experimental or control groups was, of 
course, not random. Hanson and Farrell found that those students who were in the 
BRP program scored significantly higher than those who were not (F = 6.2 [degrees 
of freedom not given], p < .001). 

While the large number of subjects made the difference in mean scores statis¬ 
tically significant, the difference in performance between the early and regular read¬ 
ing groups was actually quite small, with an effect size of only .10, making the 
practical significance of the results questionable. There are also some plausible rea¬ 
sons for the difference in scores that may help account for even this slight advantage 
attributed to the early reading program. Those districts that elected to use the BRP 
may have other institutional qualities besides the kindergarten program that may 
have affected reading performance, such as better reading programs at the upper lev¬ 
els and/or better libraries that in turn could have affected senior high reading scores. 
It seems likely that districts that self-selected into the BRP project made other spe¬ 
cial efforts to assist students* reading performance compared to those districts that 
did not participate in the BRP, making any claims about the effects of the kindergar¬ 
ten program on long-term results, at the very least, somewhat suspect. 

From the above studies there appears to exist what we may term a “Tortoise 
and Hare Effect” for age and learning to read: Children who learn to read early in 
experimental studies do better starting out, but the gains made are negligible and 
insignificant by as early as the third grade. This is consistent with cross-national 
comparisons that examine age of initial reading instruction and later achievement. 
Reporting on data from a cross-national comparison of reading, Elley (1992) states 
that countries that begin reading instruction at ages five and six have some initial 
advantages, but that those children who began “late” as seven-year-olds have usually 
caught up by age nine. It is important to note that those countries included in Elley’s 
survey that delayed instruction until age seven were economically well-off and re¬ 
ported an ample supply of books in the home, suggesting that access to print, not age, 
is the critical ingredient in achievement (Krashen 8c McQuillan, 1996). 

Early Reading and Later Success 

While learning to read by a certain age is not in and of itself critical, there is a rela- 
tionship between the level of reading performance in first grade and later years. Sev¬ 
eral studies (Juel, 1988; Lundberg, Frost, 8c Petersen, 1988; Shaywitz et al., 1995) have 
found that children who are poor readers in the early years tend to be poor readers 
in later grades (up to grade six in one study). These studies use “static group” designs, 
however, and as such are not able to isolate age of independent reading as critical to 
long-term development. What the evidence reviewed here suggests is that learning 
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to read before a certain age, all other factors being equal, does not in and of itself con¬ 
fer any lasting advantages in later grades. 2 

The logic of early “intervention” or programs aimed specifically to raise read¬ 
ing scores seems to be that if a child is put at grade level early, he will be able to “stay 
up” with school reading and will read for pleasure in the quantity needed for normal 
vocabulary and comprehension growth. The reading problem, that is to say, is essen¬ 
tially located within the child (and indicates a lack of metalinguistic knowledge). At 
the very least it is thought that changing the child is necessary, perhaps even suffi¬ 
cient, to maintaining long-term reading success. If, however, many (most?) reading 
difficulties are not within the child, but are due rather to a lack of access to compre¬ 
hensible reading materials in a supportive environment, then bringing a child up to 
grade level her first year of schooling will do little in the long run. As we will see in 
Chapter 7, poor readers are very often in precisely this situation. By the time 
remediated readers are in the middle grades, they may very well be back to (rela¬ 
tively) low levels of achievement. Unfortunately, such longitudinal research in edu¬ 
cation is rare, and we have relatively little data to go on in judging the long-term 
success of most intervention programs. 3 Krashen’s (1997a) analysis of studies on the 
long-term effects of phonemic awareness training studies show that early 
metalinguistic training does not, in fact, lead to sustained gains in later grades. To be 
sure, more information is needed on the longitudinal effects of early intervention, 
especially if a significant amount of resources are to be devoted to it. 
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Sound and Reading 


T he confusion over the role of “phonics” in reading has never been greater, both 
in the popular press and the scientific literature. On the one hand, some re¬ 
searchers have claimed that training children in “phonemic awareness” and 
sound-letter correspondences are critical to reading success (e.g., Share, 1995). Frank 
Smith, on the other hand, in his classic volume Psycholinguistics and Reading (1973), 
called the supposed importance of sound in reading “the Great Fallacy.” What do the 
sounds of letters have to do with reading? Is knowing how sounds go together in spo¬ 
ken speech critical in learning how to read? What does the relationship between such 
“phonological awareness” and reading ability mean for learning how to read? These 
are some of the issues we will address in this chapter. 


What Can Sound Do for You? 

We noted in Chapter 2 that knowing the sounds that letters make, whether garnered 
through metalinguistic instruction or the experience of being exposed to print, is one 
of the things we can use to make sense of text. Knowing how sounds are linked to 
letters can help a reader figure out the meaning of a word by reducing the possibili¬ 
ties of what an unknown word could be. For example, if a child is reading a story 
about a lion, and comes across the unknown word lair , knowing what sound the let¬ 
ter l usually makes at the beginning of the word allows him to reduce the number of 
possibilities, eliminating such possibilities as cage or house. This knowledge of 
sound-letter correspondence may be even more useful when you have been given few 
or no clues provided by other parts of the text, as is the case when a reader encoun¬ 
ters unknown words in a list of random words, or, as is done in many studies, is given 
a made-up or nonsense word. 

Yet this is very different from saying, as has been done by several researchers, 
that knowledge of how sounds relate to letters, or how words can be broken down 
into their constituent parts or “phonemes” in speech, is a central process in reading, 
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either at the early stages or for fluent reading. Part of the confusion arises from the 
mistaken belief that somehow speech is immediately and instantaneously 
“decodable” into meaning, that the mere sounds of spoken speech are directly con¬ 
nected to a words meaning. This is, as Goodman (1971) noted long ago, false: Both 
speech and writing are “codes” and neither is immediately or intrinsically related to 
the meaning they convey. Both the sounds of speech and the letters of written lan¬ 
guage are “surface” structures of language, which we assign rather arbitrarily to con¬ 
cepts or meaning. Thus, converting letters to sounds—often referred to as 
“decoding”—is not sufficient to allow text to make sense. Meaning must still be 
brought to the spoken word every bit as much as the written word. 

In fact, most words cannot be decoded letter by letter with any accuracy with¬ 
out having some idea beforehand of the word you are trying to pronounce. At least a 
rough guess at the meaning of a word is often needed to pronounce any given letter 
with accuracy. Try to pronounce the beginning of this word: 

r-e-c... 

Did you pronounce the e long as in recent or short as in recommend? Is the c pro¬ 
nounced hard as in recluse or soft as in receive? You would need to know the rest of 
the word to make such decisions, which means already having some idea of what the 
word is. Knowing how letters are pronounced can help reduce the possibilities, but 
it typically cannot do the job alone. 1 

This logical problem with the use of sound in reading goes to the heart of the 
often-heard practical rationale for “phonological processing,” that if you’ve heard a 
word but haven’t seen it in print, then “recoding” it from print to sound would be 
very helpful. In fact, at best we usually only use our knowledge of the sounds that 
letters or clusters of letters make for eliminating possibilities, rather than identifying 
precise pronunciations. Children and adults do not, in fact, typically attempt to de¬ 
code every word to sound when they read, as demonstrated by a series of studies by 
Ken and Yetta Goodman and their colleagues (see Gollash, 1982; Goodman, 1996). 
Readers often make “miscues” or mistakes that make sense in the context of the pas¬ 
sage they are reading, because they are focused on meaning first and foremost (see 
also Weaver, 1994, for a discussion of miscues in reading). 

Some Empirical Evidence on Sound in Reading 

Several researchers, such as Stanovich (1986, 1993) and McGuinness (1997), have 
attempted to argue that knowing certain elements in the relationship between sound 
and written language is critical to fluent reading and to learning how to read. An 
enormous number of studies have been conducted and continue to be published 
guided by these assumptions, and recently, reading policy in many states (notably 
California) are quite literally dictated by them. Does recent evidence on the role of 
phonological processes in reading contradict Goodman’s and Smith’s Model? 

The Goodman-Smith view of reading predicts that knowledge of sound-letter 
correspondence will make a small contribution to beginning reading, since those who 
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have such knowledge will be able to use it to help them figure out unknown words, as 
in the “lair” example given above. In the following sections, I will argue that the evi¬ 
dence to date is still consistent with that view. Since the number of studies on the topic 
of sound and reading are far greater than the amount of space available here, only 
some of the key claims made, along with a few illustrative studies, will be discussed. 2 

Good Readers Have Better Phonological Processing 
Than Poor Readers 

One type of evidence thought to disprove the Goodman-Smith Model is the well- 
established finding that good readers tend to have better sound or “phonological” 
awareness than poor readers (Stanovich, 1993). Good readers, for example, are usu¬ 
ally better than poor readers at pronouncing “nonsense” or made-up words (e.g., 
grack or wug), performing “phoneme deletion” tasks (i.e., pronounce a word with 
certain sounds deleted, such as trick without the r sound), and similar artificial word 
games. Thus, it has been argued, phonological factors must be critical in reading, or 
at least in distinguishing good readers from poor readers. 

Good readers, without question, typically do better than poor readers on many 
of these phonological tests. JuePs (1988) study of good and poor readers in fourth 
grade provides a clear case of this, as shown in Table 5.1. Poor readers scored signifi¬ 
cantly lower than good readers on tests not only of “real” reading (reading compre¬ 
hension), but also on a test that focused on sounding out words, or decoding. 
Suppose, however, that phonological abilities were themselves largely a product of 
reading and reading experience. These same correlational findings between phono¬ 
logical ability and reading would be equally consistent with the data. As Krashen 
(1998b) has observed, the relationship could also be as shown in Figure 5.1, refer¬ 
ring specifically to the type of phonological skills considered most critical by many 
researchers, phonemic awareness. 

In Chapter 3 we reviewed some of the evidence supportive of this relationship: 
Children begin with certain concepts about print through exposure to comprehen¬ 
sible written language (environmental print), then begin to identify words in prop¬ 
erly specified contexts, and eventually develop phonemic awareness of the sort that 
allows them to “decode” real words out of context or the nonsense words used to de¬ 
tect phonological processing abilities. If this relationship is correct, then the superior 


Table 5.1 

Reading Comprehension Scores and Decoding Ability of Fourth-Grade Readers 

GOOD READERS POOR READERS 

Reading Comprehension 5.9 3.5 

Decoding 42.6 28.2 


(from Juel, 1988, Table 1, p. 441) 
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Figure 5.1 

The Relationship Between Reading Experience and Phonemic Awareness 


Reading 


Phonemic 

Experience 

w 

Awareness 


phonemic awareness of good readers is not necessarily causal: Phonemic and other 
forms of phonological awareness, as well as a good deal of sound-letter correspon¬ 
dence knowledge, may largely be the result of experience with print. This knowledge 
may then be helpful to some children once they begin reading, since knowing the 
sound properties of print may in turn make some text more comprehensible (again, 
as in the “lair” example above). Thus studies that find better readers have better pho¬ 
nemic awareness could just as well be interpreted in favor of the Goodman-Smith 
view of reading and may largely be the result of experience with print. 

Phonemic Awareness Predicts Reading Growth 

In addition to studies that have found good readers to have better phonemic aware¬ 
ness than poor readers, there have been longitudinal studies that have found that 
children’s level of phonemic awareness in early grades is sometimes (but not always) 
related to reading success in later grades. Juel, Griffith, and Gough (1986), for ex¬ 
ample, found that a measure of phonemic awareness in October of first grade corre¬ 
lated moderately (r = .56) with a measure of second-grade reading comprehension. 
But this staggered or “cross-lagged” correlational evidence (relating performance on 
a test at one point in time with performance on another test several months or years 
later) does not necessarily indicate that the strength of phonemic awareness in kin¬ 
dergarten or first grade is the cause of higher levels of reading comprehension in later 
grades, as has often been concluded. This same evidence is also consistent with the 
notion that reading experience is the primary source of both reading ability and cer¬ 
tain types of phonemic awareness for many readers. 

We have hypothesized that phonemic awareness is largely a product of reading 
experience, and that reading experience is a consequence of a “print-rich” environ¬ 
ment, one in which reading materials are easily accessible. It stands to reason that 
children who have more exposure to written language in, say, the first grade, result¬ 
ing in higher levels of phonemic awareness, are also going to have higher levels of 
print exposure and reading experience in subsequent grades, resulting in both more 
phonemic awareness and better reading comprehension in later years. This assumes, 
of course, that phonemic awareness is always or typically correlated with higher read¬ 
ing ability. As we shall see, this is not always the case. At times, the size of the rela¬ 
tionship between reading ability and phonemic awareness appears to be rather 
modest among some readers, again indicating that phonemic awareness may be nei¬ 
ther a critical element in reading achievement nor a cause of reading acquisition. 
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Reading with Poor Phonemic Awareness ? 

From the evidence above, it would appear that many forms of phonemic awareness 
do not develop prior to alphabetic reading. It is also possible to find children and 
adults who are good readers in English but have low levels of phonemic awareness. 
For some researchers, these concepts appear contradictory: A good reader is in part 
very often defined as one having good phonemic awareness. But if we use the term 
“good reader” in the everyday sense, as one who can understand a passage of appro¬ 
priate difficulty, then there is indeed evidence that one can be a good reader with 
poor phonemic awareness. 

Bruck (1990), for example, found a subgroup of adults who had normal read¬ 
ing comprehension levels but poor phonemic awareness. As shown in Table 5.2, these 
“dyslexics” did just fine in reading comprehension but poorly on isolated or non¬ 
sense word tasks. They committed ten times as many errors on nonsense word pro¬ 
nunciation, and were nearly three times more inaccurate in one task of isolated word 
identification. Yet they could read —in the meaningful sense of the word—just fine. 

Similar findings for adult “dyslexics” were found by Elbro, Nielsen, and Petersen 
(1994). Their sample of adult dyslexics scored no worse than an age-equivalent group 
on reading comprehension accuracy (97.7 percent vs. 98.8 percent) but significantly 
lower on tests of nonsense word reading (67.5 percent vs. 95.9 percent accuracy). The 
only difference in real reading found was in speed: The “dyslexics” were slightly slower 
readers than the controls, a finding that is consistent with the results of a study of 
“dyslexic” medical students by Banks, Guyer, and Guyer (1995). 

Children who are good comprehenders but poor “decoders” have also been 
studied (Stothard 8c Hulme, 1996). Some children who are high in phonemic aware¬ 
ness have been found to be low in reading comprehension; some show the opposite 
pattern. Marshall and Cossu have also conducted several studies of both children and 
adults who have normal reading comprehension (or comparable to controls) but 
show virtually no ability to perform phonemic awareness tasks, including both 
“hyperlexics” and children with Down’s Syndrome (Marshall 8c Cossu, 1991; Cossu, 
Rossini, 8c Marshall, 1993; see also Campbell, 1991). All this leads to the conclusion 


Table 5.2 

“Dyslexic” and Normal College Readers 



“DYSLEXIC” 

NORMAL 


READERS 

READERS 

Reading Comprehension (Percentile) 

76 

79 

Nonsense Word Pronunciation Errors 

High Frequency/Exception Isolated 

11.71 

1.7 

Word Identification (Error Rate) 

1.57 

.45 


(from Bruck, 1990, Tables 1,2,3, and 7) 
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that real reading—reading passages for comprehension—is not to be confused with 
its sometime correlate—phonemic awareness. 

Children Need to Be Trained in Phonemic Awareness 

Currently massive efforts are being made in many states to “train” students in pho¬ 
nemic awareness in the hopes that this will produce appreciable gains in long-term 
reading scores. California has led the way with some of the most drastic changes in 
curriculum and funding to promote such efforts (e.g., California Reading Task Force 
Report, 1995). It is by no means clear, however, that these efforts will pay off, judg¬ 
ing from the current research evidence. 

Training Appears to Have Strong Effects on Phonemic 
Awareness, Weak Effects on Real Reading 

It is important to keep in mind that the goal of reading instruction is to improve 
what we normally associate with reading—comprehension of passages. As noted 
above, reading comprehension and phonemic awareness do not always go hand and 
hand. It is not surprising to find, then, that studies in which children are explicitly 
trained in phonemic awareness make huge leaps in phonemic awareness, but often 
only modest to small gains in reading comprehension. Krashen (1997a, 1998a), for 
example, computed effect sizes for four frequently cited studies of phonemic aware¬ 
ness training (Ball & Blachman, 1991; Cunningham, 1990; Lie, 1991; Lundberg, 
Frost, 8c Petersen, 1988), and found that while the gains in phonemic awareness were 
large (effect sizes .88 to 2.26), the impact on reading comprehension was modest (ef¬ 
fect sizes ranging from .28 to .57). Even these gains on “reading” are probably some¬ 
what inflated, given that three of the four studies Krashen analyzed used reading 
measures consisting, in whole or in part, of individual isolated words. Lundberg et 
al. (1988), for example, which is widely cited as showing the superior effects of pho¬ 
nemic awareness training, used a 400 isolated word-picture matching test as a mea¬ 
sure of real reading, not connected text or passage comprehension. Cunningham 
(1990) used three combined measures to test reading, including at the primer and 
primary levels measures of “sound-symbol correspondence, word recognition, and 
reading comprehension” (p. 432), so it is difficult to say how the training affected 
reading comprehension per se. Ball and Blachman (1991) used only isolated word 
identification tasks. Lie (1991) was the only study to use a test measuring exclusively 
the comprehension of connected text. 

Other phonemic awareness studies find no gains in any measure of real read¬ 
ing, even when the amount of training is substantial. Torgesen and Hecht (1996), for 
example, report on the first year of a five-year project comparing different types of 
skills training for children identified as having low scores on phoneme deletion and 
letter-naming tasks in kindergarten. Each treatment group received 80 minutes of 
one-on-one tutorials each week during kindergarten and first grade. The children 
were randomly assigned (n = 200) to one of four groups, including an “embedded 
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Table 5.3 


Comparison of Phonemic Awareness Training Groups 





PA& 



NO 

TUTORIAL 

SYNTHETIC 

EMBEDDED 


TRAINING 

GROUP 

PHONICS 

PHONICS 

Blend Phonemes 

11.4 (5.4) 

13.0 (5.5) 

14.9 (3.6) 

13.1 (4.0) 

Word Attack 

2.6 (3.5) 

5.8 (6.7) 

9.6 (7.8) 

5.4 (5.1) 

Reading Comprehension 

7.3 (6.7) 

11.4 (7.1) 

10.7 (7.5) 

9.6 (6.5) 


(from Torgeson & Hecht, 1996, Table 6.1, p. 151) 


phonics, 5 * “phonological awareness (PA) plus synthetic phonics,” regular tutorial 
group (no explicit phonemic awareness training), and a control group. At the end of 
first grade, tests of various phonological abilities (blending phonemes together, de¬ 
leting phonemes, and pronouncing nonsense words [“word attack”]) and reading 
comprehension were administered. Table 5.3 includes selected results. 

Consistent with other phonemic awareness training studies, the Phonemic 
Awareness-trained groups performed substantially better than the No Training 
group on phonemic awareness tests, but not on reading comprehension. In the word 
attack test, the effect size for the Phonological Awareness plus Synthetic Phonics 
group versus the No Training group is large (2.0). However, note that on other pho¬ 
nemic awareness measures, such as the “blend phonemes” task, the difference be¬ 
tween the PA plus Synthetic Phonics group and the No Training students was not as 
great (14.9 versus 11.4, effect size = .83). For reading comprehension, the difference 
between the two groups was not even statistically significant. It is also interesting to 
note that the No Training group received no tutorial help in kindergarten or first 
grade, yet seemed to be progressing in at least some aspects of phonemic awareness 
without instruction, as in other studies (Juel, 1988). 

What is most important about these results is that there were no significant dif¬ 
ferences among the treatment groups on any of the five measures other than the word 
attack test. That is, those groups of children who received large doses of explicit, sys¬ 
tematic phonemic awareness training did no better on most tasks than those who re¬ 
ceived no training at all in phonemic awareness. Most disappointing is the fact that 
none of the children who received extensive tutoring did significantly better in read¬ 
ing comprehension than the control children, who received no help at all. 

Even in those few studies where phonemic awareness does have an effect, the 
gains appear to be short-lived. Byrne and Fielding-Barnsley (1993) tracked a group 
of children who had received 12 weeks of phonemic awareness training in kinder¬ 
garten for a one-year period. By the time the children were in first grade, there were 
no differences between the trained and untrained groups on isolated word identifi¬ 
cation or spelling, and only a small advantage for the previously trained group in 
nonsense word reading. 
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Table 5.4 

Growth of Phonemic Awareness (Segmentation) Among Seven-Year-Olds 
(Pre- to Posttest Gain Scores) 

TRAINED GROUP 
(PHONOLOGY ONLY) 

Sound Deletion 9.90 

Nonsense Word Segmentation 7.23 

Sound Blending 6.86 

Sound Categorization 3.67 

(from Hatcher et al., 1994, p. 49, Table 2) 

Phonemic Awareness Grows without Instruction 

Little question exists that certain aspects of phonological awareness grow on their 
own without any explicit, metalinguistic instruction, and that this growth appears to 
be largely a result of reading itself. Hatcher, Hulme, and Ellis (1994) report the re¬ 
sults of a longitudinal study of a group of seven-year-olds placed in various instruc¬ 
tional treatment groups for 30-40 minute sessions each day over a 20-week period, 
ranging from “reading alone” to “phonology alone.” Consistent with the studies ana¬ 
lyzed by Krashen (1997a), those children who received no instruction in phonemic 
awareness got better, as seen in Table 5.4. 

Despite the fact that they received no formal instruction in phonemic aware¬ 
ness, the reading group made clear gains in all areas of phonological awareness and 
processing, although the “phonology only” group was superior on most measures. 
This growth can only be linked to reading, since the researchers took careful precau¬ 
tions to ensure that there was no “explicit mention of phonology ... [or] teaching 
strategies explicitly concerned with phonological linkage activities” (p. 48). Teachers 
were “frequently reminded” of this during the experiment. Correlational evidence 
also corroborates that phonemic awareness grows in most children as a consequence 
of reading and instruction, without programs to teach it. Bentin, Hammer, and 
Cahan (1991) found an “age” effect in the growth of phonemic awareness among 
their kindergarten subjects, even prior to any formal instruction in reading (though 
not, as they point out, unrelated to experience with print outside of school and in 
the home) (pp. 273-274). 

Is Phonological Knowledge Really a Consequence 
of Reading Experience?: Some Challenges 

Wagner and Stanovich, summarizing a series of five studies, claim that the relation¬ 
ship between free reading (reading done for pleasure) and phonological processing 
is “quite small” (1996, p. 214). An examination of these five studies reveals, however, 
that this outcome may very well be an indication that the type of phonological 


UNTRAINED GROUP 
(READING ONLY) 

3.25 

5.81 

4.74 

3.09 
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awareness (if any) that is needed for real reading is itself rather small The test of free 
reading was similar in all of these studies, consisting of a short checklist test that has 
been found to be a good indicator of out-of-school reading experience. The mea¬ 
sures of phonemic awareness and phonological processing varied, from phoneme 
deletion (dropping a sound from a spoken word, such as saying trick without the r 
sound— tick) to the reading of nonsense words. Two studies, Cunningham and 
Stanovich (1990, 1993), had no measures of reading comprehension, and another 
(Barker, Torgesen, & Wagner, 1992) had only a silent reading rate of connected text, 
not reading comprehension. Table 5.5 summarizes the correlations among free read¬ 
ing, reading comprehension, and phonological measures from studies cited by 
Wagner and Stanovich (1996), adding McBride-Chang, Mannis, Seidenberg, 
Custodio, and Doi (1993), which also addresses this issue. 

A fairly consistent pattern exists across studies. The highest correlations are 
always between free reading and reading comprehension, ranging from .32 to .58, 
consistent with the view that more reading leads to more reading ability. The size of 
the relationship between free reading and the phonological task is always much 
smaller (range: -.04 to .35), but so, too, is the phonological task’s relationship to read¬ 
ing comprehension (range: .05 to .36). Recall that our hypothesis is that more reading 
leads to more reading ability, and to the extent that phonemic awareness is related to 
reading, it will be largely a consequence of reading ability. If, however, certain mea¬ 
sures of phonemic awareness are not strongly correlated with reading, then it follows 
that neither will there be a strong correlation between phonemic awareness and read¬ 
ing amount. This is precisely what we observe in the studies listed in Table 5.5: low 
to moderate correlations between phonemic awareness and reading comprehension, 
and similarly low to moderate correlations between phonemic awareness and free 
reading. The lack of a relationship, then, between phonemic awareness and reading 
amount is hardly troubling if phonemic awareness is not itself linked to reading abil¬ 
ity. The failure to find a strong and consistent correlation between phonemic aware¬ 
ness and reading ability is problematic for Wagner and Stanovich, however, since 
they hold that phonological abilities are central to reading acquisition. 

Other studies attempt to link print exposure directly to variations in phono¬ 
logical awareness. Raz and Bryant (1990) conducted a longitudinal study of the rela¬ 
tionships among access to print, the use of rhyme, and reading comprehension 
among four- and five-year-olds from middle- and lower-income families. They 
found that none of the seven individual measures of “home environment,” which 
included five indicators of print access, predicted scores on a rhyme detection test 
among their four- and five-year-old subjects. The researchers analyzed the effect of 
each indicator separately, however, which very likely underestimated the effect of 
print on any of their other measures. An alternative explanation for their results is 
that no one measure adequately taps the concept of reading experience and exposure 
to written language for children, and only a measure that combines the various pos¬ 
sible sources of print experience in the home will detect its impact. 

Symons, Szuszkiewicz, and Bonnell (1996) corrected this problem with a more 
global test of reading exposure within the household by using a measure of the 
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Table 5.5 


Correlations of Print Exposure, Real Reading, and Phonological Tasks 




READING 



FREE READING TO 

COMPREHENSION 

FREE READING 


PHONOLOGICAL 

TO PHONOLOGICAL 

TO READING 

STUDY 

TASK 

TASK 

COMPREHENSION 

Stanovich and West 
(1989) 

.27 

(phonological 

choice) 

.35 

(pseudoword 

naming) 

.27 

.28 

.36 

Barker, Torgesen, 

.03 

.21 

.42 

and Wagner (1992) 

(phoneme 

deletion) 


(silent 

reading 


.28 

(phonological 

choice) 

.36 

rate) 

Cipielewski and 
Stanovich (1992) 

.31 

(phonetic analysis) 

.31 

.58 

Cunningham and 
Stanovich (1990) 

-.04 

(phoneme deletion) 

— 

— 

McBride-Chang 
et al. (1993) 
(Disabled Readers, 
Comparison) 

.20, .26 

(nonword reading) 

.05, .36 

.32, .64 

Cunningham and 

.01,.16 

— 

— 

Stanovich (1993) 

(phoneme deletion) 
-.04 

(phoneme 

transposition) 




parent’s reading experience, the assumption being that this would in turn relate to 
how much print exposure their children would have. Unlike Raz and Bryant (1990), 
Symons et al. found a significant effect for print on children’s phonological aware¬ 
ness, even after controlling for socioeconomic variables. It appears, then, that print 
exposure is a sum of a number of different variables, and that analyzing each com¬ 
ponent separately may miss the cumulative effect of all sources combined—includ¬ 
ing number of books, library visits, role models in reading, and other actions 
associated with encounters with written language. 
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Vellutino and his colleagues (1996) also attempted to measure print exposure’s 
effects on phonological awareness in their large-scale study (over 1,400 children), but 
here the problem was with the test itself. Vellutino et al. employed a “print conven¬ 
tions” measure to tap print exposure knowledge, which tests children’s familiarity 
with written English concepts such as how books are oriented, the top-to-bottom 
directionality of print, the meaning of punctuation marks, and so forth. However* 
the “normal” readers and the three “low” groups all scored rather close to the maxi¬ 
mum possible (12). Calculating from the standard deviation and effect sizes shown 
in Vellutino et al.’s Table 5, we find that the “normal” readers scored almost a perfect 
12 (11.18), and the lowest of the low groups was not far behind (9.55). In other 
words, because all groups scored near the ceiling on this test, it is unlikely to be able 
to differentiate levels of print exposure among children even if variations actually 
existed. As we shall see in Chapter 7, when broader measures are taken, substantial 
differences in print exposure and access have been found to exist among children of 
varying reading abilities. 3 


Sound Ideas About Reading 

Sound is not critical to reading, either for fluent readers or for those who are learn¬ 
ing how to read. As Frank Smith and Ken Goodman hypothesized three decades ago, 
reading for meaning is not primarily a process of decoding words to sounds. While 
knowledge of how sound and writing relate can be helpful to some children to make 
sense of text when other strategies fail, the claims that have been made about the 
necessity of phonemic awareness and its training for early reading do not appear to 
be substantiated by the evidence to date. The often-noted superiority of good read¬ 
ers in phonemic awareness seems to be largely a product of reading experience. It is 
also possible to find good readers who have poor phonemic awareness, and explicit 
training in phonemic awareness produces relatively moderate and limited results on 
reading comprehension. The failure of some studies to find a strong link between 
print exposure and performance on phonological tasks may be due to methodologi¬ 
cal problems or perhaps even to the limited impact of those phonological abilities 
on real reading. 



Context, Eye Movements, 
and Comparisons in Reading 


A lthough the role of sound and phonics instruction in reading has received 
most of the attention in the reading debates of the past fifty years, there are 
other, equally important issues involved in identifying what fluent readers do 
and how children learn to read. I noted in the proposed model of literacy acquisi¬ 
tion (Chapter 2) that the reception of comprehensible, written language results in 
reading acquisition and development. Now I return to the question of how the com¬ 
prehension of text actually takes place. How is it that we are able to understand the 
written marks on the page? Of equal interest to us, from the practical standpoint, is 
the issue of whether certain approaches to helping children learn to read have been 
found to be more successful than others. Are methods that emphasize metalinguistic 
over elaborative assistance, for example, better than their alternatives? These are the 
issues we will take up in this chapter, beginning with a discussion of the disputed role 
of “context” in reading. 


Making Sense of Print: The Role of Context 

As your eyes scan the page, what do they see? A common perception of reading goes 
something like this: We examine each and every letter in sequence, assembling them 
all together until we have a word. From there we identify each word we encounter as 
we move from left to right in the text, putting these words together to form a sen¬ 
tence. We then gather up these sentences until we end up with a paragraph, and so 
forth down the page. This letter-by-letter, word-by-word approach, like the previous 
chapter’s “sound out the word” logic, has a certain popularity, even among some re¬ 
searchers. But we have good reason to believe that it is not correct. 

One of the oldest findings of experimental psychology is that there is a rather 
strict limit to the number of random letters we can identify at a single glance, usually 
4 to 5 (Cattell, 1885, cited in Smith, 1973). If, however, these letters are arranged to 
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make two or three unrelated words, we can identify perhaps a dozen letters or more in 
the form of words in the same amount of time. And if these words are themselves ar¬ 
ranged to form a phrase or short sentence, we can in effect identify two to three dozen 
letters in that exact same instant, only in the form of a meaningful string of words. 

How is it possible that under some conditions we can make out only four to 
five letters, but under other conditions identify words and phrases consisting of 20 
or 30 letters? Quite simply, our prior knowledge of words and how they go together 
greatly facilitate the rapidity with which we can identify them. To choose among ran¬ 
dom letters means deciding among 26 alternatives at each step. By definition, there 
is no pattern to “speed” the process along. But as fluent readers, we know a good deal 
about words and how they go together, and we can eliminate in advance certain pos¬ 
sibilities. Most of this knowledge is implicit, of course, not something we typically 
think about consciously. We know, for example, that the letter q is usually followed 
by the letter m, so that once we see a q we can reduce the number of possibilities for 
the following letter down to one. Taken to the next level, a string of meaningfully 
combined words (a sentence) takes less visual information to identify than would be 
needed to read these exact same words arranged at random. As was pointed out in 
Chapter 2, our nonvisual information—what we can refer to as our background 
knowledge or “context”—helps us identify words and meaning much faster due to 
the elimination of unlikely alternatives. 

Trying to identify words letter by letter would be much too cumbersome for 
fluent reading in any case, causing an overload on what our short-term memory 
could handle (Smith, 1973, 1994). Instead, we typically identify words without re¬ 
sorting to a letter-by-letter analysis, although once identified, we can then break the 
word down into its constituent letters if so required. Note that this is not to say that 
we fail to physically look at the letters, or that we simply pass over the visual infor¬ 
mation contained in them. Smith (1973) noted that one of the keys to our ability to 
read at the speed we do is our “feature analysis” of letters and words: We “sample” 
from the visual information on the page, examining aspects of letters (perhaps even 
something from every letter) without necessarily identifying those letters individu¬ 
ally as we make our determination of the entire word. 1 Similarly, we do not necessar¬ 
ily identify each word in succession as our eyes scan the page, but rather we identify 
the meaning conveyed by those words that make up the phrase or sentence. 2 

All this will take place in normal, fluent reading. Of course, sometimes read¬ 
ing “breaks down,” and we fail to comprehend a phrase or word. When this happens, 
we may need to resort to other, less efficient strategies for making sense of print. We 
may stop and read more slowly, or reread, trying to identify individual words in an 
attempt to grasp the meaning. Under some conditions, we may even attempt to iden¬ 
tify individual letters of words to figure them out, if the other words in the sentence 
do not provide their meaning for us. This is especially true if we are unnaturally 
forced to isolate words in a sentence, or have our attention directed at individual 
words, as is sometimes done in laboratory tests of reading ability. Under these con¬ 
fusing or contrived situations, we may make use of other ways to identify a word, 
such as making use of sound-letter clues. 
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Putting Context Studies in Context 

A series of studies have been conducted over the past three decades on the role of 
“context” in reading, some of which are thought to contradict the Goodman-Smith 
view presented here (or at least as I have interpreted their view). Unfortunately, little 
agreement exists on what context is or how it should be defined, especially when it 
comes to interpreting the results of these experimental studies. As I’ve noted. Smith 
(1973,1994) defined context broadly as primarily nonvisual background knowledge, 
the reader’s “understanding at any given point in the text of what it is about, en¬ 
hanced by what has already been seen behind and sometimes ahead of where the eye 
happens to fall.” This knowledge of language, including syntax, semantics, sound- 
letter correspondences, genre, and style, “facilitates reading by reducing the reader’s 
uncertainty” (1994, p. 275). 

A somewhat narrower view of context is provided by Stanovich (1982,1986), 
who defines context as what can be determined from the visual information present 
in the sentence or passage. Context, then, is what is “predictable” or congruous with 
the printed text in question. Thus, in the sentence, “The man drove his_,” the con¬ 

text predicts a word such as car, and to use context in this situation would be to say 
“car” if so prompted. 

Like several other researchers, Stanovich further distinguishes between two 
“levels” of processing involved in reading: word identification and comprehension. 
Word identification is a “lower level” process, which involves the identification of 
individual words on the page. Comprehension is a “higher level” process, which is, 
in a sense, putting the words we’ve identified together to form the idea or message of 
the text. Context, Stanovich claims, does help “higher” level comprehension pro¬ 
cesses, but this effect has been “inappropriately generalized to the word recognition 
level” (Stanovich 8c Stanovich, 1995, p. 90) by theorists such as Smith. Stanovich con¬ 
cludes that word identification is not dependent primarily upon context, but takes 
place for the most part independently of it. Words are first recognized and then 
passed along to the “higher level” processor for comprehension. For this reason, 
“background knowledge and contextual information attenuate as the efficiency of 
word recognition processes increases” (Stanovich 8c Stanovich, 1995, p. 91); that is, 
good readers use the visual information on the page, the “context,” less than poor 
readers. According to this view, we use context merely to compensate for a lack of 
decoding skills to identify words (Stanovich, 1986). Therefore context use—the use 
of the visual information on the page that shows what is predictable and congruous 
with the given text—is more typical of poor readers than of good ones. 


Some Arguments on Context Use 

Even if we accept Stanovich’s definition of context, the evidence used to support his 
claims is not, I think, very convincing, and in many cases is ambiguous. The most 
common type of evidence on context use comes from studies employing a measure 
that we might call the “missing word” test. In this measure, the last word of the sen- 
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tence is deleted and then shown to the reader after a brief pause. The reader is also 
shown words in a “neutral” sentence (“The next word will be_”), where the sen¬ 

tence gives no clue as to the meaning of the missing word. The time it takes the reader 
to identify a word in the normal sentence minus the time it takes her to identify a 
word in the “neutral” sentence is considered to represent how much “context” facili¬ 
tated the speed of reading words. Several studies using this type of test have found 
that the difference in how long it takes readers to identify words between the normal 
and neutral sentences is greater for poor readers than for good readers (e.g., 
Schwantes, Boesl, 3c Ritz, 1980; Stanovich, 1982,1986; West 3c Stanovich, 1978). In 
other words, poor readers seem to benefit more from having words in “context” 
(again, in Stanovich’s sense) than good readers. 

There are plausible alternative explanations of these results, equally consistent 
with the data presented by Stanovich and his colleagues and with the Goodman- 
Smith Model, as Krashen (1998a) notes. The condition of having to identify a spe¬ 
cific word in a sentence places the reader in an unnatural condition of focusing on 
identifying words rather than doing what he normally does, which is identify mean¬ 
ing. This shift in focus may have consequences for the strategies that will be adopted 
by the reader. In particular, the use of neutral-context sentences means that all read¬ 
ers will need to draw on something other than context to identify individual words. 
In other words, the task becomes very much like individual word identification, in 
which the ability to use other cues for meaning, such as sound-letter correspon¬ 
dences, will now be at a premium. Since better readers are also better than poor 
readers at decoding words in isolation, we should therefore expect that good read¬ 
ers will perform relatively faster in the neutral sentence condition compared to poor 
readers. The difference between the context and neutral conditions may be smaller 
for good readers than for poor readers because good readers can both employ their 
superior decoding ability in the neutral context and use context more effectively. 
This is entirely in line with the Goodman-Smith Model and the results found by 
Stanovich and others. 

Thus, two variables are being tapped in the calculation of “context” in the miss¬ 
ing word test. Stanovich (1986) was aware of this confound due to decoding ability 
in his context calculations and attempted to control for it by controlling for the level 
of decoding ability in good and poor readers via a longitudinal study. Stanovich, 
Cunningham, and Feeman (1984) examined a group of good and poor readers in the 
fall and spring of their first-grade year. The children were given two paragraphs to 
read: a coherent paragraph written at the first-grade level, and a “random” paragraph 
where those same words were listed in random order. The coherent paragraph read¬ 
ing was thought to measure context use, and the random list task was thought to tap 
decoding ability. The researchers compared the performance of the good readers in 
the fall with the poor readers in the spring, so that the two groups were roughly equal 
on the test of decoding. 

Unfortunately, the way in which Stanovich and his colleagues measured con¬ 
text in this study rendered the results, like those of previous investigations, ambigu¬ 
ous on the question of poor versus good readers* use of context. Rather than use the 
missing word test, Stanovich et al. (1984) measured context by comparing the num- 
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ber of words read correctly per second on the coherent versus random paragraph. As 
in previous studies, they found that the poor readers seemed to benefit more from 
the meaningful sentence context than the good readers, since the difference between 
the coherent and random conditions was greater for the poor readers than the good 
ones. However, under the category of “words read correctly,” the researchers counted 
only the words read by the children that appeared in the text. Unlike other studies 
(e.g., Goodman, 1965,1996), any word read aloud that was not exactly the same as 
that found in the text, but made sense according to the context of the passage, was 
counted as wrong. This scoring system therefore may well have underestimated the 
full effects of context. Interestingly enough, the good readers actually did make more 
errors than the poor readers at equivalent levels of decoding ability (14.8 vs. 4.6, ef¬ 
fect size = .8), although we don’t know whether the errors were contextually appro¬ 
priate or not. 3 The results from this study do not show, then, that poor readers use 
more context than good readers, even in the narrow sense defined by Stanovich. 

Another problem underlying many of the context studies to date is that readers 
will not adopt strategies that will make optimal use of context when they are forced 
to confront a mixture of context and no-context sentences, as noted by Massaro and 
Sanocki (1993). They will, in a sense, “have their guard up” during the experimental 
conditions, and the “context” conditions will not truly measure what context does in 
normal, fluent reading. Stanovich and West (1983) attempted to answer this objec¬ 
tion in part by “blocking” the experimental sentences used so that readers would get 
“mostly” context conditions (66 percent of the time) or mostly noncontext (no or 
incongruous) conditions (again, 66 percent of the time). But this may not solve the 
problem, since the occurrence of incongruous words in one-third of the sentences 
will alert the reader that a context strategy will not be particularly successful. 

Studies in which readers are in fact allowed to use context show quite clearly 
that readers make heavy use of context. Sanocki and Ogden (1984) and Norris (1987) 
both found that when readers are placed in a situation where the experimental 
stimuli contain mostly congruous sentence conditions (that is, a typical reading situ¬ 
ation), they clearly take advantage of context facilitation, even in the rather unnatu¬ 
ral laboratory conditions of having to accurately identify or make a decision about a 
single word at the end of a sentence (see also Sanocki et al.,1985). 4 

Is Text Really Predictable? 

Another argument used against the Goodman-Smith Model is that words are not, in 
fact, highly “predictable” in text, and hence context is not particularly helpful in re¬ 
ducing uncertainty. Citing several studies on the probability that an adult reader can 
predict correctly words in a text only 20 to 35 percent of the time (e.g., Finn, 1978- 
1979), Stanovich (1993) claims that these figures alone contradict a model of fluent 
reading where context plays an important role in word identification: Our “guesses” 
from context would be too inaccurate to be of much use. However, this same evi¬ 
dence can also be taken to mean that natural text contains an enormous amount of 
information that reduces uncertainty of upcoming words by as much as a third. 
More importantly, Stanovich seems to have incorrectly interpreted Smith’s position 
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on what is meant by “predictable Smith does not claim that readers guess or accu¬ 
rately predict exactly what the next word of a sentence will be, only that the natural 
redundancy of text allows for the number of unlikely alternatives to be reduced. Thus, 
a 20 to 35 percent chance of getting the word exactly correct underestimates the true 
nature of the prediction. The predictability probabilities in Finn (1977-1978) are 
calculated based upon how well readers were able to guess the next word accurately 
in a given sentence during experimental conditions, not upon the amount of uncer¬ 
tainty reduced by context. Perfetti, Goldman, and Hogaboam (1979) report that their 
fifth-grade subjects (Experiment 3) gave a response to a deleted word in a passage 
that was either the exact word or contextually appropriate 58 percent of the time, 
demonstrating that text can provide a good deal of information in helping the reader 
eliminate unlikely alternatives. 

A Word on Eye Movement Studies 

How context is used in fluent reading has often been related to studies on how and 
where the eyes fixate when we read. It has been claimed that such studies demon¬ 
strate that we do not “skip” words or letters when reading, but rather, that we exam¬ 
ine each and every unit of print available to us, contrary to what Smith is said to 
assert (e.g., Stanovich, 1993). The principal problem with this interpretation is that 
the “Skipping Hypothesis” does not appear to have been Smith’s position to begin 
with. As noted earlier. Smith (1973,1994) stated quite clearly that we make a “fea¬ 
ture analysis” of letters and words as we reduce uncertainty and grasp meaning from 
written language, although it remains unclear precisely which features we use. “Sam¬ 
pling” text on the page does not require, then, that our eyes literally jump over words. 
Indeed, Smith conjectures that it does not seem unreasonable that we benefit from 
receiving a “smooth inflow of selected visual information” as we go about the busi¬ 
ness of comprehension (1994, p. 255). It is equally important to point out that we 
can’t be said to identify individual letters or words one by one merely because we scan 
across them. Identification of individual words, if it takes place at all, may happen 
after the eye has already moved on to the next fixation. 

There are several methodological problems in the way in which most eye fixa¬ 
tion research is conducted, as Krashen (1998a) notes. Readers are placed in highly 
unnatural positions, with their heads often constricted by chin rests, bite plates, and 
the like. In addition, readers are instructed to pay close attention to what they are 
reading, since they will be tested on it when they finish, and they are forced to read 
rather uninteresting texts, leading to a tendency to fixate more carefully than might 
otherwise be the case. 

Method Comparisons: Which Way Is Best? 

A final point of contention in the debates over what constitutes fluent reading and 
how children learn to read is the subject of method comparison studies, studies that 
attempt to determine which approach (if any) is best in teaching children to read. It 
has been claimed, for example, that classroom studies show that a heavy emphasis 
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on metalinguistic instruction produces superior results in early reading achievement, 
and thus investments in programs to train students in phonemic awareness and 
phonics will produce substantial dividends. None of the major research reviews to 
date, however, have been very useful in helping us answer the question of which 
method is best, and widely cited comparisons of teaching methods often suffer from 
ill-defined labels or inconclusive results. 

ChalVs “Great Debate” Revisited 

Easily the most influential review of reading methods in the past thirty years is 
Jeanne Chairs Learning To Read: The Great Debate (1967/1983), an impressive syn¬ 
thesis of research on the teaching of reading. Unfortunately, Chalfs 1967 review now 
has little methodological relevance to current debates over “whole language” versus 
“phonics” in early reading, yet it is still cited as “proving” that phonics and 
metalinguistic training is a superior approach for teaching reading. Chall compared 
what she called “meaning-emphasis” versus “code-emphasis” methods, but many of 
the “meaning-emphasis” methods examined often had little in common with our 
current understanding of how children make sense of texts as outlined in Chapters 2 
and 3. One “meaning-emphasis” method in particular, “Look-Say” or “whole word,” 
was defined by what it wasn't —teaching children explicitly about letter-sound cor¬ 
respondences (“phonics”). This meant that even rather meaningless activities, such 
as the Look-Say strategy of memorizing “sight words” in isolation, were counted as 
“meaning-emphasis” as long as there was no explicit phonics teaching. The other 
“meaning-emphasis” method in Challs book,“intrinsic phonics,” was said to com¬ 
bine some teaching of letter-sound correspondences in the context of other (equally 
undefined) instruction. The “code-emphasis” approach analyzed in the book was 
“systematic phonics,” where children were taught about sounds and letters in a sys¬ 
tematic way not necessarily related to what they were trying to read, or often before 
they were exposed to real reading at all. 

The uncertainty about what actually went on in the classrooms compared in 
Chairs review should be reason enough for casting doubt on any conclusions drawn 
from it. In particular, it must be stressed again that the “meaning-emphasis” ap¬ 
proaches in Chall bear little or no relation to current materials and methods of 
teaching reading, so citation of Chairs reviews does nothing to bolster the case for 
present-day advocates of metalinguistic instruction. What is perhaps more interest¬ 
ing (and surprising, given the influence of the first edition of her book) is the weak¬ 
ness of Chairs original evidence in favor of explicit, direct, and systematic phonics 
instruction, even compared to the ill-defined “Look-Say” and “intrinsic phonics” 
methods. The results summarized in Chapter 4 of her book reveal that deciding on a 
superior method based on the studies she reviews amounts to little more than a coin 
toss. Table 6.1 summarizes a “vote count” of the silent and oral reading comprehen¬ 
sion measures used in Grades One and Two. 5 

No method is clearly favored in the first and second grades when it comes to 
reading comprehension, and none of the studies Chall reviewed that included 
phonics/no phonics comparisons extended beyond the second grade. In a separate 
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Table 6.1 

Vote Count of Findings from Grades 1 and 2 Studies Reviewed by Chall (1967) 

STUDIES FAVORING 


No Phonics (“Look-Say”) 4 

Some Phonics 2 

Intensive, Systematic Phonics 4 

No Difference 1 


(from Chall, 1967/1983, Tables 4-1A & 4-1B) 


comparison (ChalPs Table 4-2B), the differences between “intrinsic” versus “sys¬ 
tematic” phonics were found to be neglible on silent and oral reading measures. 
The systematic phonics was superior in 8 findings, the smaller dosage of phonics 
(intrinsic) was better in 4 cases, and no difference was found in another 4 studies. 
Superior results in only half of the studies is not particularly strong evidence for a 
pro-phonics position. 

jRevietvs of “Whole Language” 

More recent attempts to summarize the research literature on method comparisons 
have been equally difficult to interpret, largely for the same reasons that plague 
ChalPs review: vague labels, poor study quality, and limited information about what 
actually went on in the classes studied. Stahl and Miller (1989) compared a combi¬ 
nation of “whole language” (WL) and “language experience approach” (LEA) stud¬ 
ies to methods that focused more on teaching “skills” and using skill-based “basal” 
reading materials. As in the case of ChalPs “Look-Say,” however, the categories used 
in the Stahl and Murray review, whole language and language experience approach, 
in reality covered a multitude of methods. It is not clear what exactly was being com¬ 
pared in many studies, or even whether WL and LEA can be combined under a single 
heading (McGee & Lomax, 1990; Schickedanz, 1990). 

The language experience approach involves, in essence, having children dictate 
stories to the teacher and read the product, although other elements are included in 
some programs. “Whole language” was defined in Stahl and Miller’s (1989) review 
as having the following characteristics: (1) There is an emphasis on using children’s 
own language, (2) the class is “child-centered” rather than “teacher-centered,” (3) the 
use of real story books is promoted over commercial basal readers, and (4) phonics 
is only taught in the context of meaningful reading, if at all. It should be noted that a 
great deal could go on under these descriptors, none of which specifies that children 
will necessarily have more contact with comprehensible print. Unfortunately, Stahl 
and Miller found that, even if specific program descriptions were included, “few 
studies used observations to verify fidelity to the intended method” (p. 92). Given the 
heterogeneity of the studies used for both comparison groups, it is not surprising 
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Table 6.2 


Amount Spent on Various Reading Activities in Basal and LEA Classrooms 
(in Minutes/Day) and Relationship to Reading Achievement 




LANGUAGE 

CORRELATION 



EXPERIENCE 

WITH 

READING 

BASAL 

APPROACH 

READING 

ACTIVITY 

CLASSROOMS 

CLASSROOMS 

COMPREHENSION 

Reading Basal 

48.1 (13.1) 

5.9 (8.1) 

.47* 

LEA Chart Stories 

11.3 (8.0) 

24.9 (5.9) 

-.13 

Sight Word Teaching 

14.3 (6.5) 

15.5 (5.9) 

.24 

Phonics 

14.6 (4.7) 

14.8 (3.4) 

.16 

Drawing Pictures with Stories 

8.4 (6.1) 

18.6 (7.4) 

-.31* 

“Other” Reading Activities 

7.4 (5.7) 

22.5(12.3) 

-.09 


(from Harris & Serwer, 1966, Tables 5 and 6.) 
* = statistically significant, alpha = .05 


that Stahl and Miller found the overall differences between WL/LEA and basal/skill 
approaches to be virtually nonexistent (effect size = .09, favoring whole language/ 
language experience approach). 

Some of the studies included in Stahl and Miller’s review illustrate that the label 
placed on a program is less important than what actually takes place in the classroom. 
This is particularly true of some of the LEA studies or of those investigations where it 
was not clear in the description how much actual real reading students did. Harris and 
Serwer (1966)—one of a series of United States Department of Education-sponsored 
early reading studies included in both Stahl and Miller’s (1989) review as well as in the 
second edition of Chall’s (1967/1983) book—discovered that the children in basal/ 
skills groups actually did more reading than the LEA classes they studied. Table 6.2 
shows the amount of time spent by teachers in basal/skills and LEA classrooms and 
the standard deviations, an indication of variability within a given method. 

Note that two groups did about the same amount of phonics instruction 
(about 15 minutes), but the basal group spent nearly twice as much time actually 
reading either a commercial basal or LEA chart stories, 59.4 minutes versus 30.8 for 
the LEA group. For all teachers combined, the amount of time spent on the basal 
reader was positively correlated to reading comprehension (r = .47). This does not 
mean, of course, that we should return to the often stilted “Dick-and-Jane”-style 
basal readers. Interesting and well-written story books can also be read by children. 
It may mean, however, that providing lots of opportunities to read comprehensible 
texts may be a more important criterion for comparing classroom instruction than 
“basal” or “LEA” designations. 

Since Stahl and Miller (1989) note that few of their studies took such careful 
approaches to measuring how programs were actually implemented, it is difficult to 
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Table 6.3 


Percentage of Time Spent on Reading Activities in LEA and Traditional Classrooms 

(Teacher-Led Group) 





CORRELATION WITH 




READING 

ACTIVITIES 

TRADITIONAL 

LEA 

COMPREHENSION 

Sight Word Practice 

5.33 (3.15) 

8.37 (9.32) 

-.04 

Word Analysis 

13.38 (6.69) 

5.37(1.84) 

.14 

Oral Reading of Text 

4.32 (2.59) 

7.42 (8.38) 

.09 

Silent Reading of Text 

6.72 (4.69) 

6.91 (8.39) 

.53 


(from Evans & Carr, 1985, Tables 1 and 6) Percentage of time is out of one-half day of 
instruction. 


make generalizations on how inaccurate labeling in other studies may have affected 
their results. An analysis of instructional time by Evans and Carr (1985), another 
study included in the Stahl and Miller review, is not encouraging on this point, how¬ 
ever. Like Harris and Serwer (1966), Evans and Carr found that “language-oriented” 
LEA groups differed from “routinized performance” traditional classrooms in time 
spent on some activities but not others, and not always in the expected direction. 
Table 6.3 displays some of these differences. 

The most important thing to note in Table 6.3 is that the LEA group did not 
spend considerably more time on reading than the traditionally taught classes, par¬ 
ticularly on teacher-led silent reading, which had the largest single correlation with the 
reading comprehension test (.53). The traditional group spent considerably more time 
on phonics activities (word analysis), and the LEA more on sight-words practice. 

Stahl and his colleagues (Stahl, McKenna, 8c Pagnucco, 1994) reviewed studies 
that were published after the Stahl and Miller (1989) metaanalysis and reported that 
whole language approaches did better in four studies, skills in one study, and no dif¬ 
ference was found in 12 others. However, it is unclear from their report which stud¬ 
ies were classified under which category. In addition, since many of the included 
studies were unpublished, it is very difficult to determine whether “whole language” 
really meant more reading than the traditionally taught groups. We are thus left with 
the same problems of imprecise labels that plague previous reviews. 

When “Whole Language” Means Real Reading 

It does little good to attempt to judge the adequacy of teaching techniques merely by 
various labels that are attached to them, as the evidence above suggests. In fact, it is 
probably best to avoid such labels altogether, in that they are open to interpretation, 
and focus instead on the actual description of events in the classroom. Instead, we 
need to examine studies in which an approach, however labeled (“whole language,” 
“literature-based”), is clearly defined and/or includes measures of activities where 
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Table 6.4 

Reading Comprehension Results for Literature-Based and Control Classrooms 

EXPERIMENTAL EXPERIMENTAL 

MEASURE 1 2 CONTROL 

Probed Recall* 23.13 21.22 14.59 

CTBS Reading Comprehension* 62.62 62.61 59.54 

1 = posttest means adjusted for pretest scores 
(from Morrow, 1992, Tables 3 and 9) 


students are exposed to more meaningful print as compared to classrooms where less 
exposure is given or those where the focus is more on isolated skills. When this is 
done, we see that approaches heavy in print exposure are almost always superior (and 
no worse) than the alternatives. Some of these studies examine results in terms of 
reading achievement, while others look at reading attitudes, habits, or related behav¬ 
iors (e.g., amount and quality of writing). 

Reading Comprehension 

Several recent studies have found “print-rich” classrooms to produce superior results 
in reading comprehension when compared to more skills-oriented approaches. Mor¬ 
row (1992) undertook a large sample study (n = 166) in nine second-grade class¬ 
rooms located in ethnically and socioeconomically diverse schools. A clear difference 
in the amount the experimental and control groups read and were read to is indi¬ 
cated in the treatment descriptions provided. The two experimental groups both re¬ 
ceived heavy doses of print exposure via access to a well-stocked classroom library 
and teacher-guided reading activities that emphasized reading to children, indepen¬ 
dent reading of books, and making and reading student-created books. In addition, 
one experimental group participated in a “home-based” reading program, where 
parents were encouraged to read to their children and engage them in reading. The 
control group used the traditional basal reader program, along with some reading 
aloud to the children. Table 6.4 lists measures used by Morrow that relate specifically 
to reading comprehension. 

The experimental groups that received greater exposure to print had superior 
comprehension in the probed recall measure, which tapped both traditional compre¬ 
hension elements (detail, cause and effect, inference) as well as story structure (plot, 
setting, theme). No difference was found on the comprehension subtest of the Com¬ 
prehensive Test of Basic Skills (CTBS). 6 

A large-scale study conducted by Knapp and associates (1995) examined 
school performance over a two-year period in a variety of areas in over 140 grade one 
through six classrooms in schools located in low-income communities. The research¬ 
ers classified their classrooms as having high, medium, and low “meaning emphasis” 
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Table 6.5 


Meaning Emphasis in 66 Classrooms 



HIGH- 

MODERATE- 

LOW- 


MEANING 

MEANING 

MEANING 

Minutes/Day Reading 

48 

18 

5 

Read/Write Integration (%) 

68 

36 

10 

Discuss Reading (%) 

69 

37 

12 

Deeper Understanding (%) 

33 

20 

18 

Skills in Context (1-3 scale) 

2.3 

1.8 

1.6 


(from Knapp et ai., 1995, Figure 4-1) 


in reading. A “meaning emphasis” included spending time on reading texts, discuss¬ 
ing texts, focusing on “deeper understanding,” integrating reading and writing, and 
teaching skills within the context of reading rather than in isolation (pp. 66-67). Re¬ 
sults from 66 of the 140 classrooms in the first year of the study revealed some dra¬ 
matic differences in emphasis, as noted in Table 6.5. Students in the “high-meaning” 
classrooms spent nearly ten times as many minutes reading as the “low-meaning” 
classes, and devoted a substantially greater percentage of their day to talking about 
their reading, connecting reading and writing activities, and engaging in more 
thoughtful discussion of texts. 

The results in terms of reading comprehension (CTBS test) among all 140 
classrooms were analyzed first by comparing those students in only the high- and 
low-meaning groups (about a third in each category). Table 6.6 summarizes the re¬ 
sults, reported in terms of Normal Curve Equivalent (NCE) scores. Note these are 
not simple gain scores, but posttest score differences adjusted for initial pretest per¬ 
formance and the students* poverty level. The number of students tested in the nine- 
month cycle was much greater (more than a thousand) versus the twelve-month 
measures (around 450). 

Meaning-emphasis classrooms did significantly better in year one (grades 1,3, 
and 5), and better (but not significantly so) in year two (grades 2, 4, and 6) when 
measured over a single academic year. The twelve-month gains were less impressive, 
with no significant differences reported in the smaller sample of students used. Still, 
it is important to keep in mind the unusually large number of schools, classrooms, 
and students compared in Knapp et al.’s analysis, such that the generally positive di¬ 
rection of the results is still quite impressive. 

Freppon (1991) investigated the abilities of 24 first graders randomly selected 
from two skills-based and two literature-based classrooms. While the study is only 
quasi-experimental in design in that Freppon examined existing classrooms, her re¬ 
sults are indicative of the impact that immersion in meaningful written language can 
have. Observations were made in each classroom to measure the amount and type 
of interactions that took place, and her data indicate that the literature-based class- 
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Table 6.6 

NCE Reading Score Differences of High- and Low-Meaning Emphasis Classrooms 



9 MONTHS 

12 MONTHS 


(FALL TO SPRING) 

(FALL TO FALL) 

Grades 1,3,8c 5 (Year 1) 

+5.6 

-0.5 

Grades 2,4,8c 6 (Year 2) 

+ 1.4 

+3.3 


+ = favors meaning emphasis 

(from Knapp et al., 1995, Figures 7.1 & 7.2) 


rooms did indeed engage children in more meaningful print in terms of the amount 
of time spent reading (18-20 minutes versus 5-9 minutes in the skills classrooms) 
and reading to the children (6-10 minutes versus none). Freppon measured the 
children’s ability to comprehend passages via an “altered passage” test, where children 
were asked if a certain passage made sense and if not, why not, as well as an oral read¬ 
ing measure. The literature-based students rejected the nonsensical passages at sig¬ 
nificantly higher rates than the students in the skills classroom and were much more 
likely to comment on how it was not storylike or didn’t make sense. The two groups 
were equally accurate in their oral reading. Ironically, the literature-based students 
were much more successful in actually sounding out words correctly in the stories as 
compared to the children in the skills group (53 percent to 35 percent), although the 
latter had received more instruction of sound-letter correspondences. 

Positive results have been found in other studies where students clearly en¬ 
gaged in more reading and had greater access to print than comparison skills groups. 
Hagerty, Hiebert, and Owens (1989) studied children across three grades (2,4, and 
6) in 12 literature- and skills-based classrooms. Students in the literature-based 
classes outperformed those in the skills classes on the reading comprehension por¬ 
tion of the standardized test, controlling for pretest differences. Eldredge (1991) 
similarly found that a “modified whole language” approach produced better reading 
comprehension results than a traditional basal method for 56 first-grade students 
from schools serving low socioeconomic communities. Klesius, Griffith, and 
Zielonka (1991) did not find significant differences in reading comprehension be¬ 
tween their “whole language” and traditionally taught first-grade classrooms. It is 
clear, though, that the teachers involved were slow to implement the changes in the 
curriculum, and the likely result was that the children in the whole language class¬ 
rooms did not in fact experience a significantly greater amount of exposure to text. 

Reading Attitudes and Habits 

As in the case of reading comprehension, students in classrooms with a print- 
exposure focus have been found to be superior in terms of their attitudes toward 
reading and the amount of reading they do. These factors should not be taken lightly: 
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It does little good to teach children to read if, when they leave school, they hate read¬ 
ing. Morrow’s (1992) study discussed above also included a measure of the amount 
of reading done by the participating students. She found that the literature-based 
classes not only read better than the control groups, but also reported reading more 
books and magazines. The children were also able to name more books and authors, 
confirming the fact that they had more exposure to print. Eldredge (1991) also found 
that his modified whole language groups had significantly more positive attitudes 
toward reading as a result of their exposure to more reading. The researcher-created 
attitude measure had 100 items; whole language students averaged 79.54 (standard 
deviation = 19.3), the basal students, 71.59 (standard deviation = 20.3). 

McKenna, Stratton, Grindler, and Jenkins (1995) conducted three studies on 
the effect of different approaches to early reading instruction on reading attitudes. 
The first two studies found no differences between students in schools said to be en¬ 
gaging in “whole language” and traditional instruction, but no actual teacher prac¬ 
tices were measured or observed. A third study did look at the impact of individual 
“whole language” teachers on reading attitude more closely, this time focusing on the 
specific environment of successful and less successful teachers in terms of producing 
positive reading attitudes among students. Here clear differences were noted: The 
more successful teachers in terms of promoting higher reading motivation among 
students had more print on display and books accessible for students. 

Gambrell (1996) reports on a large-scale 10-week project, “Running Start,” to 
increase reading motivation among elementary school students. Classrooms were 
provided with approximately 50 to 60 new books, and additional home and school 
reading was encouraged. One study with 550 first-grade students found that the 
print-access program was indeed more successful in increasing students’ motivation 
to read than traditional approaches. Students in the Running Start program were 
more motivated, took more books home to read, and spent more time reading. These 
positive effects appeared to be long-lasting: A selection of students who participated 
in the project were interviewed six months later, and the significant differences 
among students in motivation and reading habits persisted. 7 Finally, Mervar and 
Hiebert (1989) studied the amount that students actually read at school and home 
in two different instructional approaches for 20 second-grade children. The students 
in literature-based classrooms spent considerably more time choosing books in the 
school library (mean of 11.6 minutes versus 2.60 for the skills group) and were much 
more sophisticated in the way they chose their books (e.g., sampling the text, inspect¬ 
ing more than one book, using the card catalog). 

Other Literacy Measures 

Several of the studies cited above also included other measures of literacy perfor¬ 
mance in comparing types of reading instruction. Morrow (1992) included several 
measures of the written work by children in her literature-based and traditional basal 
classrooms and found that the children in the experimental groups did significantly 
better in several measures, including story structure, variety of vocabulary, and quan- 
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tity of writing. Children in Morrow’s literature-based classes were also superior in 
the ability to retell stories in oral and written form. 

Similarly, Purcell-Gates, McIntyre, and Freppon (1995) found that students in 
kindergarten and first-grade classrooms where greater emphasis was placed on 
reading to children and with children made greater gains in their use of literate lan¬ 
guage than those in skills-emphasis classes. Large amounts of time were devoted to 
read-alouds and giving children time to read self-selected books, while the skills 
groups spent considerably less time engaged in reading. The results of children’s 
written language samples revealed significant differences in the frequency and 
breadth of literate language. Hagerty et al. (1989), who noted a significant advan¬ 
tage for literature-based classrooms in reading comprehension (described above), 
found no differences on writing scores across the three grade levels they studied. 

A small-scale study by Freppon (1995) found that a group of students (N = 17) 
who had attended whole language kindergarten and first-grade classrooms began to 
change their perceptions of reading when they attended a more skills-oriented class 
in grade two. No differences were found in a test of oral-reading accuracy, but other 
differences emerged between those who continued receiving whole language instruc¬ 
tion and those who did not. Children from the skills-oriented classes showed much 
less knowledge about things to read outside of class than the literature-based classes, 
wrote less in their journals, and produced less literate language in their writing. 

An Exception to the Rule?: The Footman Study 

Studies in which children are clearly exposed to more text typically do better (and 
never worse) than those in classrooms with less focus on print exposure or ones in 
which skills or phonics receive heavy emphasis. Despite the consistency of these find¬ 
ings, one widely publicized study has recently been reported to show that a heavier 
use of metalinguistic instruction produced better results than the alternative ap¬ 
proaches. While the final version of the study has not yet been published, a prelimi¬ 
nary version was made available by the principal investigator in the study, Barbara 
Foorman (Foorman, Francis, Beeler, Winikates, 8c Fletcher, in press). Unfortunately, 
no strong conclusions can be drawn about the data as they appear in this version of 
the study. Missing is detailed information on what actually took place in the “whole 
language” and “phonics” treatments for the children involved; that is, it is not clear 
whether the “whole language” group actually had more exposure to print than the 
“phonics” group. Preliminary results found no differences between the two groups 
on a combined reading-comprehension and word-identification test, and both 
groups scored very low on a Formal Reading Inventory comprehension assessment. 


Conclusion: No Phonics? 

Previous reviews of reading instruction methods have used vague labels to define 
approaches, attempted to equate approaches that differ in significant ways, and gen¬ 
erally failed to define very carefully the amount of actual print exposure children re- 
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ceive in class. All of these problems leave open to question the conclusions of these 
efforts. When we restrict ourselves to (an admittedly) much smaller group of studies 
in which the differences between groups in terms of the amount of connected read¬ 
ing of text is more clearly established, then students overall do better in a variety of 
literacy measures. I should stress again that this does not mean that some 
metalinguistic instruction, for some children, may not be useful, which is consistent 
with our discussion in previous chapters. However, the notion that an emphasis pri¬ 
marily on skills and phonics instruction produces superior results to programs cen¬ 
tered on providing children with a lot of interesting and comprehensible texts is not 
supported by the available evidence. 



7 


Real CriseSy Real Solutions 


The Limits of Method 

After having taught in, observed in, and written about schools for more than a de¬ 
cade, John Holt made the following observation more than twenty-five years ago: 

Many ... people are demanding better reading programs in their schools. They 
might be wiser to try to get more branch libraries in their districts, or better yet, 
neighborhood storefront libraries or traveling bookmobiles, with newspapers, pe¬ 
riodicals, and paperbacks—the kinds of reading material we know kids like to read. 
What’s the point of having kids learn to read if after they’ve learned there’s noth¬ 
ing to read... Access to reading matter, not reading methods, is the name of the game. 
(Holt, 1972/1993, pp. 205-206; emphasis added) 

Warwick Elley, director of the International Association for the Evaluation of Edu¬ 
cational Achievements’ (IEA) 1992 international study of reading in 32 countries in¬ 
volving more than 200,000 students, came to this conclusion in 1994, after having 
examined the results of that massive undertaking: 

Acceptable levels of literacy are achieved by most pupils, in most systems, despite a di¬ 
versity of reading methods and traditions. In general, however, achievement is great¬ 
est when the educational systems are well endowed financially, when teachers are 
well educated, when students have ready access to good books, when they enjoy 
reading and do it often, and when their first language is the same as that of the lan¬ 
guage of the school. (Elley, 1994, pp. xxi-xxii; emphasis added) 

The similarity of these conclusions, one reached after years of careful observation, the 
other the product of sophisticated statistical calculation, is striking. The availability of 
books to read—and the subsequent amount of reading done—appears to be as critical, 
and certainly no less so, in determining success in reading as classroom instructional 
methods. This is not to say that teaching methods are unimportant, nor that access to 
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Table 7.1 

Correlations Between Reading Method and State NAEP Score 

PERCENTAGE OF CORRELATION WITH CONTROLLED 

TEACHERS USING: NAEP SCORE FOR POVERTY 

Whole Language .07 .006 

Phonics -.59 -.07 


(from NCES, 1994) 


print is the exclusive or sufficient condition for literacy. We have seen that different 
methods can have very distinct outcomes, and individual children may need different 
types of assistance. But without access to print, teaching methodology can have only a 
limited impact and may not, in any case, be capable of explaining significant variations 
in reading achievement observed between school districts, states, and countries. 

An illustration of this point comes from the National Assessment of Educa¬ 
tional Progress (NAEP) scores, the reading test given to a nationally representative 
sample of United States schoolchildren every two years. Table 7.1 shows the correla¬ 
tions between the NAEP scores and the percentage of teachers in a state who say they 
used "whole language” teaching and those who favored "phonics” approaches. Data 
are drawn from the 1992 fourth-grade scores (National Center for Education Statis¬ 
tics, 1994). The second column in Table 7.1 shows the simple correlation between re¬ 
ported reading method and a state’s NAEP score. For whole language, the figure is a 
nonsignificant .07, close to zero. For phonics, it is negative and significant, at -.59. 
However, it is also true that phonics is used more heavily in poor states than in 
wealthy ones, so that to distinguish the possibly negative effects of poverty in gen¬ 
eral and phonics use in particular, it is necessary to control for poverty (here repre¬ 
sented by a composite of the percentage of children in a state living in poverty 
[Bureau of the Census, 1995] and the percentage of schools in the state with more 
than 75 percent of their students enrolled in free lunch programs). As can be seen in 
the table, the "whole language” correlation becomes even closer to zero, and the 
phonics relationship is similarly nonsignificant and miniscule at -.07. 

Unfortunately, these "self-reports” of teaching method are not always accurate, 
and the labels "whole language” and "phonics” may simply be too vague to be of 
much use. But the NAEP data is the best we have on classroom practices in United 
States classrooms, and the failure to find any relationship between method and 
achievement suggests that, from a policy standpoint, we may not wish to limit our 
focus to teaching method as the sole or primary factor in explaining why reading 
scores differ from one place to another, particularly when it comes to large-scale dif¬ 
ferences as have been observed among schools and states. It is conceivable that the 
variations in the supply of reading materials may overwhelm differences in teaching 
methods, given that teachers and students are limited by the resources available to 
them. Let us now examine the effect that variations in access have in predicting read¬ 
ing success and failure. 
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Figure 7.1 

The Relationship between Print Access and Reading 



Evidence on the Importance of Access 

In our model of literacy acquisition outlined in Chapter 2, it was suggested that lit¬ 
eracy development is dependent on the amount of reading done which, in turn, is 
affected by the amount of reading material available to the reader (Krashen, 1993). 
This relationship is shown in Figure 7.1. 

Reading instruction can intervene in the model, of course, by making reading 
more comprehensible in a variety of ways and thus increasing the amount of read¬ 
ing done. I begin with the evidence for the second part of the model, more reading 
means better reading, since it is fairly well agreed upon among researchers along all 
points of the methodological spectrum. 

More Reading Leads to Better Reading Achievement 

A variety of investigations of the relationship between the amount of reading and 
reading performance have been done, from simple surveys to experimental studies. 
The most recent NAEP test results provide a straightforward illustration of the con¬ 
nection between reading frequency and achievement, as shown in Table 7.2: Students 
who read more, read better. 

More sophisticated research methods have found similar results. Anderson et 
al. (1988) had fifth-grade students keep a diary of all of their out-of-school activities 
over a period of several weeks. Table 7.3 shows the percentile rank on a standardized 
achievement test, the amount of reading in minutes (including books, comics, news¬ 
papers, and magazines), and the estimated number of words read per year. Note that 
the more reading students did, the higher their reading score. 

The range of time and amount of reading is quite stunning: from 1.5 hours to 
less than 2 minutes per day, and from close to 5,000,000 words to around 50,000 
words read per year. Not surprisingly, Anderson and his colleagues found that the 


Table 7.2 

Fourth Graders’ Self-Reported Reading Frequency and 1994 NAEP Score 

ALMOST EVERY DAY 1-2 TIMES A WEEK NEVER/HARDLY EVER 

221 217 198 


(from NCES, 1994) 
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Table 7.3 


Amount of Time Spent Reading and Reading Achievement 
of Fifth Graders (N = 155) 


PERCENTILE 

MINUTES OF TEXT 

ESTIMATED NUMBER OF 

RANK 

READING PER DAY 

WORDS READ PER YEAR 

98 

90.7 

4,733,000 

90 

40.4 

2,357,000 

70 

21.7 

1,168,000 

50 

12.9 

601,000 

20 

3.1 

134,000 

10 

1.6 

51,000 


(from Anderson et al.,1988; Table 3) 


amount of time spent reading was the best predictor of reading achievement. To test 
whether the amount of outside reading was really responsible for reading growth 
over the period of the study, Anderson and his colleagues controlled for the effects 
of the students’ previous test scores (from grade two). The result was a positive cor¬ 
relation between reading achievement for book reading (.29) and comic book read¬ 
ing (.19). Taylor et al. (1990) similarly found in their study using reading logs to 
measure time spent reading that the amount of silent reading done in school was a 
significant predictor of growth in reading comprehension among fifth and sixth 
graders, after having controlled for previous reading ability. 

An alternative to reading logs for measuring the amount of reading done was 
developed using signal-detection “recognition checklists,” where subjects indicate 
whether or not they recognize a certain author or title. Real choices are mixed in with 
false selections to correct for guessing. The idea is that those who recognize more 
authors or titles have, in fact, read more (West, Stanovich, & Mitchell, 1993). Mod¬ 
erate to strong positive correlations between these measures of print exposure and 
literacy indicators have been found at all age levels, and across languages. A summary 
of the results of a selection of these studies is shown in Table 7.4. Those who read 
more have better reading comprehension, better vocabulary, and better writing skills. 
These relationships, while expressed in simple correlation form in Table 7.4, have 
been found to survive multiple regression analysis when several other related vari¬ 
ables are entered into the equation, including other forms of school achievement. 

The correlational relationship between amount of reading and reading 
achievement has been confirmed in a large body of experimental research. Krashen 
(1993) reviewed 41 studies on “sustained silent reading” (SSR) programs, where stu¬ 
dents are given a certain amount of time each day (usually 10-20 minutes) to read a 
book for pleasure. These students typically do as well as or better than those who en¬ 
gage in traditional classroom instruction. When given a long enough opportunity to 
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Table 7.4 

Recognition Checklist Studies: Print Exposure and Literacy Development 




PRINT EXPOSURE 

LITERACY MEASURE 

STUDY 

LANGUAGE/AGE 

MEASURE 

& CORRELATION 

Cipielewski and 
Stanovich (1992) 

English/3rd and 
5th graders 

Book Titles 

Read. Comp.: .49 

.58 

Stanovich and 
Cunningham (1992) 

English/college 

Author Names 
Magazine Titles 

Read. Comp.: .54 

.48 

McBride-Chang 
et al. (1993) 

English/5th-9th 

graders 

Book Titles 

Read. Comp.: .64, .32* 

West etal.( 1993) 

English/adults 

Author Names 
Magazine Titles 

Vocabulary: .62 

.48 

Stanovich and 

Cunningham 

(1993) 

English/college 

Author Names 
Newspaper Titles 

Read. Comp.: .44 

.34 

Stanovich, West, 

English/college 

Author Names 

Vocabulary: .63, .60 

and Harrison 
(1995) 

8c older adults 
(>66 years old) 

Magazine Titles 

.51, .41** 

Lee, Krashen, and 
Tse(1997) 

English/adults 

Author Names 

Vocabulary: .55 

Rodrigo, 

McQuillan, 
and Krashen 
(1996) 

Spanish/adults 

Author Names 

Vocabulary: .75 

Lee and Krashen 
(1996) 

Chinese/high 

schoolers 

Author Names 

Writing: .22 


* = .62 is for what McBride-Chang et al. classified as “nondisabled” readers, .32 for “dis¬ 
abled” readers 

** = First correlation is for older adults, second for college students 


operate, SSR programs almost always show superior results to programs with less 
exposure to print. These impressive experimental results are true for second- 
language learners as well. In the United States, programs designed for “heritage lan¬ 
guage” speakers, those who come from families where a language other than English 
is spoken, have shown that more reading leads to more literacy development at a va¬ 
riety of age levels (McQuillan, in press: c). Similar results have been found in a num¬ 
ber of large-scale experimental studies involving more than 10,000 elementary and 
middle school students studying English as a foreign language in several different 
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countries and across a number of first languages. Children supplied with compre¬ 
hensible texts outperform those with less reading exposure in all categories of lan¬ 
guage proficiency, including reading comprehension, writing, grammatical accuracy, 
and even listening comprehension (Elley, 1991,1997).' 

More Access Leads to More Reading 

The connection between the amount of reading done and reading proficiency has 
been well known and accepted for a number of years. Less well known but of equal 
importance has been the finding that more access to reading materials leads to more 
reading, and subsequently higher reading achievement, and can itself explain a great 
deal of variation in reading scores. As noted in the previous chapter, Morrow (1992) 
found that increasing the access to books for second-grade classes in classroom lit¬ 
eracy centers (libraries) and teacher-guided literature activities led to more reading 
by the experimental groups and higher levels of reading achievement by the end of 
her study. Table 7.5 compares the pre- and posttest survey responses of students who 
had increased library access and those who did not. Note that while the experimen¬ 
tal groups increased their amount of reading dramatically over the course of the 
treatment, the control group saw only minimal changes. 

Halle, Kurtz-Costes, and Mahoney (1997) also found that print access had a 
powerful effect on reading achievement. The researchers examined parental attitudes 
and behaviors among a group of African American families (n = 41 children) with 
third- and fourth-grade children attending an inner-city school. Halle and her col¬ 
leagues took a number of different behavioral and attitudinal measures before the 
school year began and correlated them with reading achievement scores 9 months 
later. Interestingly, the parental “instruction” variables, such as the frequency with 
which parents spoke to their children about certain topics, told their children how 
to pronounce words correctly, taught them the names of countries and states, and so 
forth, were not correlated with test scores. The only behavior measure on the part of 
parents that correlated significantly with reading scores was the number of books in 
the home. This was true even after controlling for the level of the mother s educa¬ 
tion and their reading scores a year earlier. This is, as the researchers point out, a 
rather conservative estimate of the effects of book access, since the reading level of 
the children of the previous year is highly correlated to current reading achievement. 


Table 7.5 

The Effects of Increased Print Access on Reading Habits 


EXPERIMENTAL 1 EXPERIMENTAL 2 CONTROL 


Activity 

Pre 

Post 

Pre 

Post 

Pre 

Post 

Read a book 

12.6 

18.4 

13.6 

20.5 

12.1 

13.6 

Read a magazine 

2.8 

14.2 

3.6 

16.2 

3.2 

5.6 


(from Morrow, 1992) 


REAL CRISES, REAL SOLUTIONS • 73 


and the mother s level of education and number of books in the home are also posi¬ 
tively correlated. The correlation found, then, between books at home and reading 
scores (r = .29) would almost certainly be much stronger had these controls not been 
in place, making their findings all the more striking. 

Not surprisingly, Halle and colleagues (1997) concluded that providing reading 
materials to low-income African American students may be one of the most impor¬ 
tant things schools can do. They point out that while providing books in the home 
may appear to be within the parents’ control, it is in fact “limited by the amount of 
time, money [and] skills available to provide these supportive resources” (p. 534), and 
that “within a community that has severely limited financial resources... the respon¬ 
sibility for creating a literacy-enhanced environment need not be shouldered entirely 
by individual families ... Because many of these families cannot afford to purchase 
children's books , it becomes all the more important to make community resources ... eas¬ 
ily and readily available within disadvantaged communities ” (p. 535, emphasis added). 

Similar results connecting print access to more reading have been obtained at 
the elementary and junior high school levels in other studies. In an innovative ex¬ 
periment, Dorrell and Carroll (1981) put noncirculating comic books in the library, 
then measured the average daily circulation of noncomic materials. The results were 
remarkable: Circulation due to increased use of the library jumped 30 percent, dem¬ 
onstrating that merely increasing access to books increases the likelihood that they 
will be read. Ramos and Krashen (in press) report an equally dramatic result with 
elementary school students who previously had little access to books. After they were 
taken to the local public library, the amount of reading and enthusiasm for reading 
increased dramatically among students. Rucker (1982) reports that a random sample 
of junior high school students who were given two free subscriptions to magazines 
related to their interests for one year scored significantly higher on standardized tests 
of reading compared to a control group. Neither the students nor the teachers knew 
of the experiment, but the power of print access was substantial all the same. 

The relationship between access to reading matter and reading frequency is 
also demonstrated in a set of case studies by Worthy (1996), who looked at eleven 
“reluctant” readers in middle school, where she conducted observations of and in¬ 
terviews with each student. Worthy found that the primary reasons for students’ in¬ 
frequent reading was not that they dislike reading per se“but because [they do] not 
have access at home to the kinds of reading materials that interest [them]” (p. 485). 
Access was one of three barriers to reading Worthy identified among her informants 
(the other two were a lack of choice in the books assigned to them and the absence 
of opportunities in school to read for pleasure). The students noted that they had 
little access at home to the kinds of reading materials they enjoyed, and the school 
library was unable to fill that gap. 

Larger-scale correlational studies provide supporting evidence of the positive 
effects of print access on reading frequency and achievement. Lance, Wellborn, and 
Hamilton-Pennell (1993) found at the state level that the access to books in school 
via the library was the most powerful predictor of academic achievement among sev¬ 
eral variables analyzed, controlling for socioeconomic status. (At two of the six grade 
levels that Lance et al. analyzed, school library quality was actually a more powerful 
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predictor of reading scores than socioeconommic status measures.) Krashen (1995) 
found a similar relationship for elementary school students when examining data at 
the national level in the United States. Controlling for per pupil spending per state 
and computer software holdings, Krashen found that the quality of the school library 
was a significant predictor of fourth-grade NAEP reading scores. 

International comparisons show the same strong effects of print access on 
reading achievement. Elley’s (1994,1996) comparison of reading score, home envi¬ 
ronment, and school and public library data from several countries came to the con¬ 
clusion that access to print was the most powerful correlate of reading achievement 
Controlling for differences in economic conditions among the participating coun¬ 
tries, Elley (1996) found that the size of the school library was the number one fac¬ 
tor distinguishing the reading scores of nine-year-olds between the high and low 
scoring nations, with an impressive effect size of .82. Frequent silent reading time was 
the next most important variable, with an effect size of .78. 

Print Access, Free Reading, and NAEP Reading Scores: 

Ranking the States 

My own analysis of the 1992 state NAEP reading scores provides yet another indica¬ 
tor of how important print access is in determining reading achievement. By mea¬ 
suring the amount of print access and the frequency of reading by state, we can see 
how state rankings compare to their performance on reading. 2 Table 7.6 shows a list 
of variables drawn from the NAEP surveys and other federal sources on school and 
home resources related to print access and reading achievement. Print access was 
here defined as that consisting of the three likely sources of reading matter for chil¬ 
dren: the home, the school library, and the public library (McQuillan, 1997a). In ad¬ 
dition, a measure of the amount of reading and level of poverty present in a state 
were also included. Table 7.7 shows the rankings of the states in Free Reading, Total 
Print Access (home, school, and community), and NAEP reading scores. 

An inspection of the scores in Table 7.7 will reveal the strong relationship be¬ 
tween how much print access a state has, how frequently students read, and reading 
achievement We note in Table 7.8 that more print access is correlated positively with 
free reading (r = .720), suggestive that the better access children in a state have to 
reading materials at home, in school, and in the community, the more they read. 
Print access and free reading are also closely related to test scores: reading proficiency 
correlates .852 with Total Print Access, and .644 with Free Reading. This means that 
simply knowing how much print access students have in a state can explain 73 per¬ 
cent of the variance in reading test scores; knowing how much free reading takes 
place accounts for 41 percent of the variance. 

One possible criticism of this analysis is that print access is merely a proxy for 
larger socioeconomic influences on reading achievement. When we calculate the cor¬ 
relation between poverty (defined in Table 7.6) and a state’s NAEP score, the rela¬ 
tionship is indeed as strong as the one between print access and reading achievement, 
in the opposite direction (r = -.77). However, even after we control for the effects of 
poverty statistically, we see that the effects of print access on reading scores remains 



REAL CRISES, REAL SOLUTIONS • 75 


Table 7.6 

List of Variables Used in Analysis 

Poverty 

1. School Lunch: The percentage of schools in the state with more than 75 
percent of their students enrolled in free lunch programs. 

2. Children in Poverty: The percentage of children five- to seventeen-years-old 
in the state who live in families with income below the federal poverty line 
(Bureau of the Census, 1995). 

Total Print Access 

Home 

3. Books at Home: The percentage of families having more than 25 books in 
the home. 

4. Magazines: The percentage of families subscribing to at least 1 magazine 
regularly. 

5. Newspaper: The percentage of families subscribing to 1 newspaper. 

School 

6. School Library Books: The number of books per pupil in the school library 
(White, 1990). 

7. School Library Subscriptions: The number of magazine subscriptions per 
pupil in the school library (White, 1990). 

Community 

8. Public Library Books: The number of books per person in the public li¬ 
brary (NCES, 1995). 

9. Public Library Subscriptions: The number of magazine subscriptions per 
person in the public library (NCES, 1995). 

Free Reading 

10. Books Read: The percentage of students who read at least 1 book for their 
own enjoyment in the last month. 

11. Frequency: The percentage of students who read for fun on their own time 
at least once a month. 

12. Sustained Silent Reading (SSR): The percentage of students who are pro¬ 
vided time for reading books of their own choosing in school at least once a 
week (as reported by the student). 

Reading Comprehension 

13. NAEP Score: The average state NAEP fourth-grade reading score. 


Note: All data taken from NCES (1994) unless indicated above. Average Print Access Ranks 
were derived by taking the average of the ranks of Public Library Quality, School Library 
Quality, and Print at Home. An analysis using standardized scores to combine the three 
measures produced nearly identical results. 
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Table 7.7 

Ranking of States on Print Access (by NAEP Score Rank) 


STATE 

NAEP 

READING 

RANK 

AVERAGE 

PRINT 

ACCESS 

RANK 

FREE 

READING 

RANK 

NAEP 

SCORE 

(RAW) 

New Hampshire 

1 

7 

5 

229 

Maine 

2 

10 

22 

228 

Iowa 

4 

4 

1 

227 

Massachusetts 

4 

13 

5 

227 

North Dakota 

4 

2 

9 

227 

Wisconsin 

6 

4 

5 

225 

New Jersey 

8 

9 

35 

224 

Wyoming 

8 

5 

20 

224 

Connecticut 

9 

6 

20 

223 

Indiana 

16 

11 

27 

222 

Minnesota 

16 

8 

2 

222 

Nebraska 

16 

1 

7 

222 

Pennsylvania 

16 

17 

7 

222 

Utah 

16 

18 

13 

222 

Virginia 

16 

15 

13 

222 

Idaho 

17 

22 

23 

221 

Missouri 

17 

14 

13 

221 

Oklahoma 

17 

23 

33 

221 

Ohio 

19 

20 

22 

219 

Colorado 

21 

21 

9 

218 

Rhode Island 

21 

25 

13 

218 

Michigan 

23 

16 

14 

217 

West Virginia 

23 

32 

39 

217 

New York 

24 

12 

20 

216 

Delaware 

26 

31 

33 

214 

Kentucky 

26 

38 

41 

214 

Texas 

26 

39 

33 

214 

Georgia 

29 

36 

20 

213 

North Carolina 

29 

26 

25 

213 
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Table 7.7 (con’t) 




AVERAGE 




NAEP 

PRINT 

FREE 

NAEP 


READING 

ACCESS 

READING 

SCORE 

STATE 

RANK 

RANK 

RANK 

(RAW) 

Tennessee 

29 

37 

28 

213 

Arkansas 

32 

29 

37 

212 

Maryland 

32 

19 

24 

212 

New Mexico 

32 

34 

36 

212 

South Carolina 

34 

30 

33 

211 

Arizona 

35 

27 

33 

210 

Florida 

36 

33 

35 

209 

Alabama 

37 

41 

41 

208 

Louisiana 

38 

28 

38 

205 

Hawaii 

39 

25 

20 

204 

California 

40 

40 

20 

203 

Mississippi 

41 

42 

42 

200 

District of Columbia 42 

36 

27 

189 


Print access and free reading ranks are derived from the average of individual ranks com¬ 
puted as noted in Table 7.6. Standardized scores produced nearly identical results as those 
shown here. The averaged ranks were used in the correlational and multiple regression 
analyses. Again, the use of standardized scores produced virtually the same outcomes. 


Table 7.8 

Spearman Rank Order Correlations for Print Access, Free Reading, 
and NAEP Scores 

NAEP SCORE RANK FREE READING RANK 

Free Reading Rank 0.644 — 

Average Print Access Rank 0.852 0.720 
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Table 7.9 


Regression Analysis of Print Access and Poverty on NAEP Test Scores 


VARIABLE 

BETA 

STANDARD ERROR 

T 

P VALUE 

Constant 

188.852 

9.26 

20.39 

.000 

Poverty 

-.480 

.094 

-5.07 

.000 

Print Access 

1.12 

.260 

4.30 

.000 

R 2 = .72, Adjusted R 2 

= .71, F = 51.22 




Table 7.10 

Regression Analysis of Free Reading and Poverty on NAEP Test Scores 

VARIABLE 

BETA 

STANDARD ERROR 

T 

P VALUE 

Constant 

167.16 

40.55 

4.12 

.000 

Poverty 

-.615 

.112 

-5.48 

.000 

Free Reading 

.68 

.452 

1.51 

.140 


R 2 = .61, Adjusted R 2 = .59, F = 31.23 


strong and positive (r = .63), indicating that print access makes a powerful contri¬ 
bution to determining reading achievement in the states independent of socioeco¬ 
nomic factors. 

Attempting to combine the effects of both print access and free reading statis¬ 
tically in a single analysis is more difficult with this particular set of data, since the 
variables are highly correlated, resulting in a fair degree of multicollinearity (Berry 
8c Feldman, 1985; McQuillan, 1997a). This means that the variables are so interre¬ 
lated that trying to tease out their specific, independent effects is problematic. A 
separate multiple regression analysis on the effects of poverty and print access on test 
scores was conducted, however. The regression analysis allows us to see if, taking into 
account both the effects of poverty and print access at the same time, each variable 
has a significant impact on test scores. As can be seen in Table 7.9, both variables do 
indeed make a distinct contribution to test scores (significant effects are indicated 
when the value of P is less than .05). Poverty is negatively related to reading scores, 
while print access has a significant positive influence on reading. 

A similar analysis of poverty and free reading on reading scores (Table 7.10) 
shows, however, that free reading falls short of having a significant effect separate 
from poverty (p = .14). The free reading measure may fail to show an impact due to 
the restricted range in the measure itself: The mean score for free reading was 87.26 
for the states (out of 100), with a standard deviation of only 2.27. There may not have 
been enough variation in the amount of free reading by state to produce an impact 
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Table 7.11 


Intercorrelations among the Variables 






PUBLIC 

TEACHER- 


BOOKS 

EXPENDITURES 

SOFTWARE 

LIBRARY 

PUPIL 

SAT-Verbak 

.29* 

-.13 

.01 

.34* 

-.11 

Books 


-.06 

-.10 

.10 

-.37** 

Expenditure 



-.04 

.17 

-.20 

Software 




.09 

.04 

Public Library 





.01 


a = Partial correlations controlling for percent of students taking the SAT by state 
* = p < .05, ** = p < .01 


on test scores. Other state-level studies (Krashen, 1995) and analyses done with in¬ 
dividual students (versus states, as in this case) have shown the effects of free read¬ 
ing separate from socioeconomic factors (Ortiz, 1986). 

Print Access and Reading Achievement in High School: 

State SAT Verbal Scores 

The powerful effects of print access extend beyond the elementary school level. A 
separate analysis (McQuillan, 1996a) of state SAT scores and library quality also 
shows the significant relationship between the availability of reading materials and 
reading comprehension. Using data from all fifty states and the District of Columbia 
(from Morgan, 1990; Podolsky, 1991; Snyder 8c Hoffman, 1990; and White, 1990), I 
calculated the correlation coefficients among the following variables: public library 
circulation per capita, school library holdings (books per pupil), teacher-pupil ratio, 
per pupil funding, computer software holdings, and an adjusted average score of the 
SAT Verbal scores for a state. 3 

Table 7.11 provides the intercorrelations for all the variables. Three relation¬ 
ships were significant: SAT scores were positively correlated with both the number 
of books per student in the school library and the per capita public library circula¬ 
tion, while the number of books per pupil was negatively correlated to the teacher- 
pupil ratio. It appears that states that have fewer library holdings also have more 
crowded classrooms. No other significant correlations were observed. 

Table 7.12 shows the results of the multiple regression analysis, using the SAT 
Verbal scores (weighted for the percentage of students participating) as the depen¬ 
dent variable, with all predictor variables entered simultaneously. Not surprisingly, a 
measure of print access—public library circulation—was the only significant predic¬ 
tor of reading scores, with another—school library quality—falling just short of sig¬ 
nificance (p = .10). Once again, access shows itself to be an important predictor of 
reading achievement, even when controlling for such factors as per pupil spending, 
classroom size, and amount of computer software available. 4 
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Table 7.12 


Multiple Regression Analysis 


PREDICTOR 

B 

BETA 

STANDARD ERROR 

T 

P 

Books 

1.1500 

.2339 

.6946 

1.65 


Expenditures 

-.0032 

-.1705 

.0025 

-1.23 

.22 

Software 

.0338 

.0732 

.0622 

.54 

.58 

Public Library 

4.18 

.3227 

1.7664 

2.36 

.02 

Teacher-Student 

-1.0376 

-.1070 

1.3555 

-.75 

.45 

R 2 = .21, adjusted R 2 

= .12, F = 2.44, 

p < .05. 





The Problem: Access Varies Enormously 

Knowing the importance of print access on reading and reading achievement, it is 
only appropriate now to ask: How does access vary among students? The answer is 
that there are wide, sometimes dramatic disparities in print access among individual 
students, schools, and communities, and that these disparities result in large varia¬ 
tions in achievement. Elley’s (1994) study of reading achievement among more than 
200,000 nine- and fourteen-year-old students in thirty-two countries discussed ear¬ 
lier found that the number of books available at home and in the school library var¬ 
ied significantly among students. Among the participating countries, the average 
number of books in the home varied from 25 to 174, and books per student in the 
school library from as few as 0.3 to as many as 28.9. In the United States, the mean 
number of books in the home was 137, but the standard deviation of 94 indicates 
considerable differences across the sample population. Similarly, the number of 
books per student in the school library in the United States was 19, but the standard 
deviation was 13.7, again indicative of huge disparities among schools. Data from 
large, nationally representative samples collected by the United States Department of 
Education corroborate these findings. The National Assessment of Educational 
Progress report (NCES, 1994), for example, found that the percentage of fourth 
graders reporting more than four types of reading materials in the home ranged 
from 25 percent to 46 percent among participating states. 

Several smaller sample studies have found similar disparities in print access 
among children of varying ages at home, in school, and in the community. Halle and 
her colleagues (Halle et al., 1997) found considerable variability in the range of books 
present in the homes of African American children attending the inner-city school 
they studied, from a low of 4 to a high of 200 books per child, with the mean of 37.5 
and a median of 29. Feitelson 8c Goldstein (1986) discovered large disparities between 
families (n = 102) from middle-class and lower-income backgrounds in schools where 
children were judged to be succeeding and doing poorly, respectively. The researchers 
visited each of the homes and counted the number of age-appropriate books in the 
homes. They discovered vast differences in the print access at home: a total of 2,774 
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Table 7.13 


Book Access and Library Usage Among Lower- and Middle-Income Families 



LOW-INCOME 

MIDDLE-INCOME 

Number of Books Owned 

13.75 (13.70) 

35.40(35.81) 

Visits to Library/Month 

3.72 (6.60) 

9.30(14.32) 


(from Raz & Bryant, 1990, Table 5, p. 219.) 


books for the 51 middle-class families (mean: 54.4) versus only 243 total for the poorly 
performing students (mean: 4.6)—ten times fewer books for the low-income families. 
The authors note that the differences are even greater than the mean and total figures 
indicate, as 61 percent of the low-income families had no books at all at home for their 
children. In addition, they observed that some of the books shown them and included 
in their count in the low-income families were not age appropriate—such as encyclo¬ 
pedia-type volumes kept in the family parlor—and were clearly not being used by even 
the older siblings in the household. In contrast, the middle-class homes, with the eco¬ 
nomic resources to afford such items t had books that showed considerable wear and tear 
and were easily accessible in the children’s bedrooms. 

Raz and Bryant (1990) also found definite differences in print access among 
middle-class and lower-income groups of children (n = 80) even before they began 
to read independently (ages four and five). Table 7.13 shows the differences they 
found. Note the large standard deviations indicated in the parentheses, showing that, 
although the two groups differed on average greatly (with three times as many books 
in the homes of middle- versus lower-income), considerable variation also was found 
within income groups. In another survey of preschool children’s homes, Phillips and 
McNaughton (1990) found considerable variability in children’s books in the homes 
of three- and four-year-olds among middle-class families in New Zealand. The aver¬ 
age number of books found in the homes was 450, an astounding 300 of which were 
children’s books, with a range of 50 to 500 per home. 

One of the few studies to look systematically at home, school, and community 
print access for a single group of students was Smith, Constantino, and Krashen 
(1996). Smith et al. interviewed 40 students in low-income and high-income com¬ 
munities in the Los Angeles area. Children living in two economically depressed 
communities had respective averages of 0.4 and 2.67 age-appropriate books in the 
home; those living in the high-income district had 199.2. The situation was similarly 
unequal at school and in the community: School libraries in the high-income com¬ 
munity had around three times as many books per student, and public libraries had 
twice as many books per person. 

LeMoine, O’Brian, Brandlin, and McQuillan (1997) found that disparities in 
access at home and in the community continue to exist within schools themselves. 
After surveying library access policies within and among school districts in southern 
California, the researchers found that students from high-achieving schools serving 
largely middle-class children allowed greater access to books, more time to read in 
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Table 7.14 

Print Access in Urban and Suburban Schools 






CAN MAKE 


SILENT 

1 VISIT 

CAN TAKE 

INDEPENDENT 


READING 

PER WEEK 

BOOKS 

VISITS TO 


TIME 

TO LIBRARY 

HOME 

LIBRARY 

High-Achieving 

80% 

100% 

73% 

87% 

(Urban) n = 15 

(12/15) 

(15/15) 

(11/15) 

(13/15) 

High-Achieving 

100% 

100% 

100% 

86% 

(Suburban) n = 8 

(8/8) 

(8/8) 

(8/8) 

(7/8) 

Low-Achieving 

53% 

60% 

47% 

53% 

(Urban) n = 15 

(8/15) 

(9/15) 

(7/15) 

(8/15) 


(from LeMoine et al., 1997, Table 1) 


school, and more liberal check-out policies than those from lower-achieving schools 
in largely low-income neighborhoods. The results are summarized in Table 7.14. 

Allington, Guice, Baker, Michaelson, and Li (1995) also surveyed school librar¬ 
ies in 12 high- and low-income neighborhoods in New York and found that high- 
income neighborhood schools had an average of 21.5 books per student, while 
low-income schools had only 15.4 volumes. Disparities among schools were also re¬ 
ported in magazine subscriptions, size of classroom libraries, and access policies. 
Similar inequalities in print access have also been documented in languages other 
than English at schools that serve a large number of language minority students 
(Pucci, 1994; Pucci 8c Ulanoff, 1996). 5 

A Print Wasteland: The Case of California 

Ranking nearly last in the country in reading test scores, California provides a good 
case study of disparities in print access. While the state’s policymakers have decided 
to focus their reform efforts on reading methodology (Stewart, 1996), there is good 
reason to believe that California’s woes can be better traced to the abysmal state of 
its school and public libraries. California students now live in what can best be de¬ 
scribed as a print-barren desert and illustrate perfectly the situation John Holt al¬ 
ludes to in the quote at the beginning of this chapter: learning how to read but 
having nothing to read. The figures are sobering (McQuillan, in press: d): 

California has some of the worst school libraries in the country. Since children get 
anywhere from 30 to 90 percent of their free reading materials from the school 
and public library (Krashen, 1993), the health of a state’s library system provides 
a good barometer of how well its students will do in literacy achievement. Accord¬ 
ing to the most recent data available (White, 1990), California places last or nearly 
last in almost every category, including books per pupil, per pupil spending, and 
librarians per pupil. Table 7.15 shows just how far California is behind the na- 
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Table 7.15 

School Library Quality in California and the United States 



ELEMENTARY 

MIDDLE 

HIGH SCHOOL 

Books per Pupil 




U.S. 

18:1 

16:1 

15:1 

California 

13:1 

11:1 

8:1 

Per Pupil Spending 




U.S. 

15.44 

15.50 

19.22 

California 

8.48 

7.48 

8.21 

Librarians per Pupil 

Overall 



U.S. 

895:1 



California 

5,496:1 




(from White, 1990; Snyder & Hoffman, 1995) 


tional averages in library quality, and the state’s rankings are telling: 45th in per- 
pupil expenditures, 49th in books per pupil, and 50th in librarians per pupil. 

California has one of the worst public library systems in the United States . Similar 
to its poor standing in school libraries, California also ranks near the bottom on 
several measures of public library quality. In 1990-91, Chute (1992) reports that 
California ranked 39th in books and serials held per capita (1.95 versus 2.53 na¬ 
tionally), 43rd in capital outlay per 1,000 residents ($666.00 versus $1,912.00), 
and 50th in hours open to the public per 1,000 residents (72.09 hours/week). 
Not only does California rank relatively poorly now, but the situation has be¬ 
come far worse since the late 1980s. In 1993, Gibson noted that book budgets in 
the state had been cut 25 percent since 1989, the number of open hours had de¬ 
creased 30 percent since 1987, and per capita spending had been reduced 36 per¬ 
cent from 1989. Children services had been the hardest hit, Gibson found, with 
25% of public libraries reporting cutbacks in this area (Gibson, 1993). 

California’s children are poor and getting poorer. The only place children can find 
books outside of the school and public library is at home. But if you’re poor, you’re 
much less likely to be able to afford many books. California ranked 9th in the 
country in the number of children five- to seventeen-years old living in poverty 
(Bureau of the Census, 1995), with the poverty rate rising an incredible 25 percent 
in California between 1989 and 1993 (Schmittroth, 1994). According to the most 
recent figures, one out of every four California students is living in a family with 
income below the poverty line. Data gathered in the 1992 NAEP assessment con¬ 
firm these figures: California ranked 10th highest among participating states in the 
number of children participating in free lunch and nutrition programs. Not sur¬ 
prisingly, the NAEP report also found that the state ranks near the bottom in the 
percentage of homes with more than 25 books in the home (NCES, 1994). 
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Education, Income, and the Problems 
of Measurement: What Access Isn’t 

As critical as understanding the influence of print access on reading frequency and 
achievement is, it is also important to understand what doesn’t seem to influence ac¬ 
cess, at least directly, and discuss some issues related to how access is measured. A 
number of researchers, for example, have pointed out the influence of socioeco¬ 
nomic status (SES) (income, parental education) on academic achievement in gen¬ 
eral (White, 1983). Socioeconomic status and print access, while related, are not 
identical: There are children who live in middle-class families who have poor access 
to print, and those who live in low-income areas with a great deal of exposure to lit¬ 
eracy. In addition, as Heath (1982) and Taylor and Dorsey-Gaines (1983) have 
pointed out, the literacy of families from diverse backgrounds is often different from 
that displayed in school, and many families in low socioeconomic areas have a great 
deal of reading and writing in their lives. 

What seems instead to be the case is that children from low socioeconomic sta¬ 
tus homes often have less physical access on average to the kind of print materials 
related to the literacy most valued at school. As such, SES indicators serve as indirect 
but imperfect proxies for print access. But it is not low SES per se that is a barrier to 
literacy development, since those children who do have access to print and read fre¬ 
quently are successful in school. If this is indeed the case, then we should find that 
studies that provide both general measures of SES and specific measures of print ac¬ 
cess find that the two concepts are separate, and that print access has more predic¬ 
tive power in terms of reading proficiency. 

Evidence from large-scale quantitative studies bears out this conclusion. Ortiz 
(1986), in an analysis of a large national data set on nearly 100,000 United States 
school children, found that students from low socioeconomic status families had 
fewer reading materials at home and were exposed to less reading in the home com¬ 
pared with students from more highly educated families, consistent with other re¬ 
search (Elley, 1996; Morrow, 1983; NCES, 1994). However, Ortiz also found that 
when controlled for level of SES, the impact of reading materials in the home and 
the extent of family reading was significant and substantial, suggesting that print in 
the environment was the critical variable affecting reading acquisition, not poverty 
or level of education. 

White’s (1983) metaanalysis of almost 200 studies on socioeconomic status 
and academic achievement provides evidence consistent with this hypothesized re¬ 
lationship. When studies examined SES and academic achievement with the indi¬ 
vidual student as the level of analysis, the strongest correlations with academic 
achievement were found with “home atmosphere” (r = .58), rather than income (r = 
. 31) or education (r = .19). Studies which examined "home atmosphere” in White’s 
analysis included in their definition a variety of variables, including reading materi¬ 
als in the home. Similarly, Okagaki, Frensch, and Gordon (1995) found that in a 
sample of low SES Latino youths, the high-achieving students came from signifi¬ 
cantly more literate environments than their less successful peers, again indicating 
that print access, not poverty itself, was one of the most critical determinants of aca- 
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demic achievement. As noted above, however, those in poverty are more likely to 
have more limited access to print than those who are not. 

Rowe (1991, 1997) examined the effects of socioeconomic factors and reading 
behavior on the reading achievement of 5,000 students ages five to fourteen. He found 
that when home reading activities (reading alone, being read to, reading to others, and 
discussing reading), attitudes toward reading, and attentiveness at school were con¬ 
trolled for, socioeconomic status contributed very little to reading proficiency. This is 
again consistent with the notion that in and of itself, SES is not the crucial factor in 
reading achievement; rather, it is the actual amount of reading that counts. The most 
important implication of these findings is that we can improve the reading achieve¬ 
ment of students in low SES schools by improving their access to print. 

The “Try Harder” Solution: Mistaking Lack of Access 
for Lack of Effort 

Some have argued that improving resources for students who currently lack them 
will not improve literacy, and that parental “values” or lack of effort are to blame for 
poor reading performance of students. Mayer (1997), for example, claims that the 
number of books a child has at home is only "weakly related” to parental income and 
that, because they cost so little, whether children have them depends largely "on par¬ 
ents* tastes and values” (p. 113). This conclusion must be interpreted with consider¬ 
able caution. 

Mayer’s sample of children from National Longitudinal Study of Youth 
(NLSY) was restricted to four- and five-year-olds, most of whom were not likely to 
be independent readers (Durkin, 1966; McQuillan, in press: b). This alone might 
limit the generalizability of her findings for older students. More importantly, there 
were clear ceiling effects in her measure of the number of children’s books in the 
home. Parents were asked to indicate the number of books their child possessed, 
from 1 to 10. Mayer’s results are shown in Table 7.16. 

The difference between poor and middle-income parents appears small, but 
this is most likely due to the ceiling effect for the middle 20 percent: the mean was 
9.5 out of a maximum of 10. We know from the studies reviewed above on variabil¬ 
ity in access that 10 is probably well below the average number of books even in the 
homes of low-income families. Halle et al. (1997) found an average of more than 37 
books per child in their study, and Phillips and McNaughton (1990) noted that their 


Table 7.16 

Number of Books Owned by Children Ages 4-5 by Income Percentile 

POOREST 10% MIDDLE 20% ALL PARENTS 

7.3 9.5 8.816 


Standard Deviation = 2.557 
(Mayer, 1997, pp. 108 and 187) 
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middle-class sample of parents had an average 300 children’s books, or 30 times the 
maximum on the NLSY. In short, the scale used by Mayer is much too restricted to 
distinguish access differences among the sample population. The variability by in¬ 
come level is thus very likely to be underestimated by her analysis, as is its subsequent 
impact on tests of verbal ability and word recognition reported. (Although Mayer 
found that, even with this imperfect measure, the effect of print access as part of a 
larger index of “activities and possessions” in the home was still significant on the 
Peabody Picture Vocabulary Test (PPVT) even after controlling for mother’s educa¬ 
tion, verbal ability, sex, and race [p. 111].) Mayer’s additional claim that doubling the 
income of parents will not lead to a substantial increase in the number of books suf¬ 
fers from the same weakness of her percentile comparisons: A poor measure of books 
was used. 

Sowell (1997) interprets Mayer’s (1997) results to mean than, since “public li¬ 
braries are free... [T]he big question is whether parents choose to take or send their 
children to such places” (p. C3), echoing Mayer’s conclusions that parents could, if 
they wanted to, gain greater access to books for their children. But we have reviewed 
evidence that print access in poorer communities and schools is substantially worse 
than in middle-class communities, including public libraries (see also Di Loreto & 
Tse, in press). Access is thus not comparable or equally available for poor parents as 
it is to higher-income families. The presence or absence of certain “values” or effort 
is not sufficient for equaling the playing field among children, nor do they seem a 
very persuasive explanation for reading success and failure. 6 

Real Crises, Real Solutions 

There is now considerable evidence that the amount and quality of students* access 
to reading materials is substantively related to the amount of reading they engage in, 
which in turn is the most important determinant of reading achievement. Many stu¬ 
dents attend schools where the level of print access is abysmal, creating a true crisis 
in reading performance. I do nor wish to argue that simply providing books is all that 
is needed for schools to succeed, what some have referred to as the “garden of lit¬ 
eracy” approach. Teaching is much more than physical resources, and no progress 
can be made without qualified and sensitive teachers. But just as we would not ask a 
doctor to heal without medicine, so we should not ask teachers and schools to teach 
without the materials to do so. Reading material is basic to all education, and pro¬ 
viding a rich supply of reading matter to children of all ages, as well as a place and 
time to read, is the first step to bridging the gap between poor and good readers. This 
means that school libraries must be stocked with interesting and appealing materi¬ 
als and appropriately staffed; students need to be given time to read silently books of 
their own choosing (Krashen, 1993; Worthy, 1996); and states and communities need 
to fund public libraries with reading material for people of all backgrounds and in¬ 
terests. This seems to be a reasonable place to start in our efforts to provide all stu¬ 
dents with equal opportunities to reach high levels of literacy. 



Endnotes 


Endnotes for Chapter 1 

1. Considerable debate has occurred over the levels of United States math and science 
performance, but even the sharpest critics of what Stedman (1997) calls the “new perspective” 
on school performance agree that no major declines in reading achievement in the United 
States have occurred over the past three decades (Stedman, 1996; Coulson, 1996). Critics have 
instead shifted their arguments to other grounds, to say in effect that “our top students aren’t 
as good as they once were” (Coulson, 1996) or that our students have always performed at low 
levels (Stedman, 1996,1997). 

2. The NAEP report certainly encourages that interpretation. It proclaims that the cutoff 
scores “indicate if [students’) achievement meets expectations of what students should know 
and should be able to do” (Campbell, Donahue et al., 1996, p. 5). 

3. This point was made to me by Gerald Coles (personal communication, 1997). See also 
Coles (1987). 

4. Indeed, B. Shaywitz et al. (1992) found that the “dyslexic” children classified by the 
percentile method differed significantly from a comparison sample of “normal” readers in 
their mother’s level of education (Table 3, p. 644). This suggests that environmental factors 
may be the cause of this supposed “dyslexia.” Shaywitz et al. argued, however, that, after I.Q. 
is controlled for, mother’s level of education had no impact on predicting reading scores. But 
mother’s level of education, to the extent that it reflects the print environment at home, itself 
influences a child’s I.Q., as Molfese, DiLalla, and Bunce (1997) demonstrate. Thus, I.Q. may 
act as a proxy for home environment, and controlling for it may attenuate the impact of the 
mother’s level of education on reading scores. More careful measures of home environment 
are needed to rule out what appear from other research to be likely environmental influences 
on both I.Q. and reading. 

5. Note that the whole problem of establishing “standards” and “minimal competencies” is 
not solved by, say, taking a “typical” fourth-grade curriculum and then asking if students can 
read the materials required. That only begs the question of how that “typical” curriculum was 
established in the first place (and by whom). And if you set your standard at the level that 
“most” fourth graders can handle, then we are back to square one, since we are no longer 
talking about what students should do, but what the average student can do, which is not a 
standard at all. 

6. The specific relationship between age and reading proficiency is very strongly affected by 
how much one continues to read. Stanovich and his colleagues (Stanovich, West, 8c Harrison, 
1995) found that their sample of older adults (average age = 79.9 years) did not have 
diminished vocabulary knowledge (a correlate of reading proficiency) when the amount of 
reading they did was taken into account. In other words, older adults who were active readers 
did no worse on reading measures than college students who had similar levels of reading 
experience. M. C. Smith’s (1995) study of the NALS data showed that greater amounts of 
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reading is associated with higher reading proficiency at all age levels. Smith also found that 
level of educational attainment also predicts reading achievement, independent of current 
reading habits (pp. 214-215). 

Comparisons across generations are, however, tricky at best, as Rothstein (1997) warns. 
Changes in tests, retention practices, the general education level of parents, and dropout rates 
all make direct comparisons very difficult. After an analysis of three large-scale testing 
programs over the past several decades, Rothstein concludes that “if there was a golden age of 
education from which we have fallen, we have no way of knowing about it because we have 
no way of making meaningful comparisons between achievement then and now” (p. 25). 

7. Technically, the raw scores on commercial assessments such as the Iowa Test of Basic Skills 
are not directly comparable today to those of the 1950s, because the makers of the test 
“renorm” them every five to seven years. This means that the scale is readjusted so that half 
of the students score above the average and half below it. These average scores have generally 
been increasing each time the tests have been renormed over the last few decades, indicating 
a general rise in achievement for the student population on which they are calibrated. 

8. Readers of the technical reports on the United States participation in the IEA study may 
come across one potentially fatal flaw in the composition of the United States nine-year-old 
sample, but it turns out there is little concern for alarm. Binkley, Williams, and Haynes (1994) 
report selected variables from the grade-four student questionnaire, including parental level 
of education. The results show that 48 percent of the nine-year-olds stated that their father 
had graduated from college, and 44 percent said their mother was a college graduate. These 
figures appear to be much too high, and, if true, would mean that the study vastly over¬ 
estimated United States reading proficiency by including a higher percentage of children from 
college-educated families than is found in the general student population or the United States 
population as a whole. 

Fortunately, it is unlikely that these figures are correct. The IEA percentages represent 
the self-reports of nine-year-olds, who are likely to be inaccurate in their estimations of their 
parents* level of education. We know from a similar question asked on the NAEP reading 
survey that a large proportion (36 percent) of nine-year-olds state that they don’t know their 
parent’s level of education when given the option on a survey to say so. This is the problem 
with the IEA data: Students were not allowed to say they didn’t know. They had to answer 
with something or leave the item blank. Binkley et al. (1994) report that only 9 percent of the 
nine-year-olds left the parental education items blank (p. 60), meaning that many of their 
responses were probably guesses. 

One way of checking the accuracy of the nine-year-olds’ answers are to compare them 
with the older students* responses, since they are more likely to know how much education 
their parents have had. The following table reports the responses of United States nine- and 
thirteen-year-olds on the IEA survey and the 1992 NAEP survey regarding parental level of 
education. (The IEA reports father’s and mother’s level of education separately; the NAEP 
survey has only one category: “parental” level. The “father” responses from the IEA results are 
used here, since they are consistently higher than those of the mother.) 

It is clear from looking at the survey answers that failure to include a “don’t know” option 
in the IEA survey of nine-year-olds probably resulted in an inaccurate estimation of parental 
education, but not necessarily a biased sample of students. The “true” number of students who 
are children of college graduates is around 41 percent, as indicated by the NAEP thirteen-year- 
old response. The results for the NAEP seventeen-year-olds are almost identical to the thirteen- 
year-olds, giving us more confidence that eighth-grade student responses are accurate. 
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Reported Level of Parents’ Education in the 1991 IEA 
and 1992 NAEP Student Questionnaires 



9-YEAR-OLDS 

13-YEAR-OLDS 

IEA Study*: 

(GRADE 4) 

(GRADE 8) 

Less than H.S. 

10% 

(not 

High School 

24% 

available) 

Some College 

18% 


College 

I Don’t Know 

48% 


NAEP Study: 

Less than H.S. 

4% 

8% 

High School 

12% 

24% 

Some College 

9% 

19% 

College 

39% 

41% 

I Don’t Know 

36% 

8% 


* = Father’s level of education 

Source: NCES (1994); Binkley et al. (1994). 


Purves and Elley (1994) confirm that the parental education questions were found in 
an IEA pilot study to be inaccurate for the nine-year-olds in many countries (details of the 
study not given). 

9. Murray and Herrnstein (1992) argue that the SAT declines of the 1960s and 1970s are, in 
fact, significant, since while not all college-bound seniors take the exam, all of those going to 
our “top” universities take them, and therefore declines are indicative of a serious drop in our 
academic “elite.” This sounds persuasive until we consider just how many of the test takers 
are likely to go to the top colleges. There are more than 1 million college seniors who take the 
SAT. If we (rather generously) estimate that the top fifty colleges and universities each admit 
2,000 freshmen, and all of these students only applied at one college each, that would still 
yield only 100,000 test takers, or about 10 percent of the entire pool. The average score, then, 
comes largely from those who do not attend the very best colleges and universities, and 
therefore we have no reason to suppose that top students are necessarily responsible for the 
small declines in SAT scores that have been noted. Unfortunately, information on self- 
reported class rank from test takers only became available in 1975, but the scores since that 
time indicate that students in the top fifth of their class experienced no declines (Carson, 
Huelskamp, 8c Woodall, 1993). Yet Murray and Herrnstein state that scores continued to "free- 
fall” from 1975 to 1980 among the country’s best high school students. 

Ironically, the critics who cite the SAT declines ignore the evidence of steady or rising 
graduate school admission scores that have taken place over the past two decades in the face 
of huge increases in the test pool (Berliner 8c Biddle, 1995). Presumably those who go on to 
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graduate schools are the most able of those who attended college. The system that Murray and 
Herrnstein call the “mediocracy” which has resulted in “educational standards in the college 
track declin[ing] precipitously” (p. 51) has also produced students whose scores on the GRE 
have remained at record high levels. What is perhaps more unusual is that there is no mention 
in Murray and Herrnstein of the stable percentages of students at the highest levels of the one 
national test that actually is representative of all high school seniors, the NAEP. 

10. A few nine-year-olds manage to score above 300 each year, but that figure, too, has held 
stable (1 percent in 1971,1 percent in 1994). A few thirteen-year-olds have, for the first time 
in NAEP history, scored above the 350 level in 1992 and 1994 (1 percent each year), again 
indicating that, if anything, the number of students among the “best and the brightest” is 
higher now than ever before. 

11. A creative analysis of some national test data collected by the United States Department 
of Education in 1972 and 1992 by a journalist with the San Francisco Chronicle (Marshall, 
1997) reached essentially the same conclusions as my own review of the data in previous 
papers (McQuillan, 1996; in press: d), that test scores in California had not declined in recent 
years. Marshall used twelfth-grade reading scores from the National Longitudinal Survey 
(NLS) of 1972 and the National Educational Longitudinal Survey (NELS) of 1992. Although 
the state samples were not drawn randomly from the population, they were large enough 
(over 600 from 40 different California high schools in the NELS sample) to be, at the very 
least, suggestive of statewide trends. Using equated test scores from the two surveys, Marshall 
found that California’s twelfth graders ranked about in the middle of states in 1992 when 
controlling for demographic variables such as parental level of education, income, and native 
language, and experienced no declines when compared to their counterparts in 1972. 

12. Chall (1996) claims this interpretation of the NAEP scores is incorrect because “phonics 
is usually taught in grades 1 and 2, and possibly 3, [and] when it is taught in grade 4, it usually 
means that the students were already functioning below expectancy” (p. 305). In other words, 
low-scoring students in phonics classrooms are there because they are poor students to begin 
with. Chall argues that if we looked at the relationship between phonics and test scores in first 
and second grade, we would see higher scores for the phonics classrooms. 

This interpretation is plausible if we looked only at the results nationally by individual 
classroom, as they are reported in NCES (1994). It is less persuasive, however, when we 
calculate the correlation between a state’s NAEP score and a state’s use of phonics in grade 
four, which turns out to also be very negative (r = -.59). If Chall (1996) were correct, it would 
mean that no meaningful correlation exists between what teachers of a given state do in 
reading instruction in fourth grade and what they do in the first three grades. In this case, it 
would mean that school districts with a strong emphasis on phonics in grade four don’t 
necessarily use phonics in similar proportions in lower grades. 

That is possible, of course, but not very likely. It seems much more probable that, on 
the whole, states, districts, and schools that use a lot of phonics in grade four also use a lot of 
phonics in grades one, two, and three. (Imagine if whole language backers used a similar 
argument in California—that the real problem was that the state’s first, second, and third grade 
teachers were using phonics, and that the emphasis suddenly changed to whole language in 
grade four.) In any case, it may be that teaching methods have less to do with aggregate scores 
at the state level than other factors, such as access to reading materials (see Chapter 7). 

There are other problems with the “blame whole language” argument based on the 
NAEP data. Very little reliable data exists on just how widely literature-based reading 
instruction was actually implemented in the state. The 1992 NAEP reading assessment 
reported that 69 percent of fourth-grade teachers in California said that they placed “heavy 
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emphasis” on whole language practices in their classrooms, the highest percentage in the 
country. At the same time, 40 percent also said they gave “moderate emphasis” to phonics and 
68 percent said they devoted at least some of their class time to decoding skills (NCES, 1994). 
Fisher and Hiebert (1990), in an analysis of teachers self-reported labels concerning reading 
instruction and their observed reading practices, often found little correspondence between 
what teachers called themselves (“whole language,” “phonics”) and what went on in their 
classrooms. We should be cautious then, in making any strong claims about just how seriously 
whole language took root in California based on teacher self-reports. 


Endnotes to Chapter 2 

1. Despite various critiques of the details of Krashen’s theory when first proposed to explain 
second-language acquisition (e.g., McLaughlin, 1978; Gregg, 1984), theorists generally agree 
that the reception of comprehensible input is critical in language acquisition, and that it 
accounts for a major portion of second-language competence (Larson-Freeman 8c Long, 1991; 
Van Patten, 1987). 

2. I alone, of course, am responsible for the interpretation of Goodman’s and Smith’s ideas 
in this book. 

3. Given a source of comprehensible input, the actual amount of listening or reading may be 
determined by other factors not represented in the model, such as affective variables. Krashen 
(1982) has also hypothesized that the reception of comprehensible input may not be 
sufficient for language acquisition to take place. There are also affective variables, such as self¬ 
esteem, motivation, and anxiety, that may affect the extent of acquisition (Dulay, Burt, 8c 
Krashen, 1982). 

4. It is difficult to imagine, however, a case where at least some elaborative assistance would 
not be needed to help an acquirer understand oral or written messages. 

Endnotes to Chapter 3 

1. It is recognized, however, that children who can read independently can normally also recode, 
so that the ability to do the latter is at least related to the former, even if the relationship is not 
causal. Reading comprehension and recoding ability are clearly not identical, however. Children 
trained in phonemic awareness, for example, make large gains in tests of recoding but only small 
to moderate gains in reading comprehension (Krashen, 1996a; Bradley 8c Bryant, 1983), as we 
note in Chapter 5. Conversely, some adults classified as "dyslexic” show poor recoding ability and 
a slightly slower reading rate, but apparently normal passage comprehension (Elbro, Nielsen, 8c 
Petersen, 1994). Stothard and Hulme (1996) also provide evidence that some children who have 
difficulties in comprehension have normal recoding skills, and some poor decoders exhibit 
normal level of comprehension. Some second-language readers who have no problems in 
reading comprehension have low phonemic awareness (Holm 8c Dodd, 1997). 

Differences exist in how phonological awareness is defined in the vast literature on the 
topic. Stanovich (1992), for example, proposes that we reject the term “awareness” completely 
and instead talk about a phonological continuum from "deep” to “shallow” types of 
“sensitivity” (see also Yopp, 1988). A task that required a child to match words based on rhyme 
(e.g., talk and walk) would tap a shallow level of sensitivity, whereas a test that required a child 
to delete or isolate a phoneme in a word (e.g., pronounce “cat” without the “cuh” sound) 
would indicate a deep level of sensitivity. Share (1995) appears to take the position that 



92 • THE LITERACY CRISIS 


phonemic awareness of the “deep” kind is necessary for reading acquisition, and this is the 
definition I will use for this chapter. 

2. Smith (1983) notes, however, the connection between sound and the meaning of words in 
oral vocabulary is no less arbitrary than between their written form and meaning, yet clearly 
we have sufficient memory for storing these associations for oral language. 

3. It is interesting to note that many “beginning” children’s books can only be understood by 
pre-readers if they are read to—that is, they are underspecified. The situation is much the 
same for beginning second-language readers, in that there is virtually nothing available 
commercially in print that would fall under our definition of “properly specified.” To the 
extent that this is true for children’s texts, the question of whether there is independently 
available comprehensible written language for early readers is still an open one. 

4. McGee (1986) notes that Jones and Hendrickson (1970, cited in McGee) found that 
environmental print awareness developed before knowledge of print and book handling, 
indicating that it is one of the earliest encounters with text that children have. See also 
discussion of McGee et al. (1988) in text. 

5. Note also that constantly pointing at the text while reading would very likely result in the 
children shifting their attention from meaning to form, and hence may be a quite “unnatural” 
type of exposure to print. 

6. Other studies corroborate the findings of McGee et al. (1988) on the attention to graphic 
detail paid by very young children in identifying words in their environment. Hiebert (1978) 
found that preschool children had an awareness of print that extended to being able to 
distinguish where words began and ended in text, and Harste, Burke, and Woodward (1982) 
provide extensive evidence of pre-readers applying both “holistic” (use of extralinguistic cues) 
and “graphic” (use of letters) strategies in reading environmental print. 

7. Heibert (1981) conducted longitudinal studies which she later reanalyzed (Heibert, Cioffi, 
8c Antonak, 1984) in order to determine an order of reading concept acquisition. Her 
reanalysis found that the acquisition of letter name knowledge preceded knowledge of certain 
concepts of print. However, Lomax and McGee point out that the particular statistical 
procedure Heibert et al. used to arrive at that ranking involved rather stringent, “pass/fail” 
criteria for acquisition that might have missed more incremental changes (1987, p. 241). 

Endnotes to Chapter 4 

1. Cases are also known of children learning to read outside of school, often with little or no 
formal instruction, much later than their age peers. These “late readers” often make rapid 
progress once they begin reading, catching up to their peers within two years of learning how 
to read (Krashen 8c McQuillan, 1996; McQuillan, in press: b). 

2. There are at least three other possible reasons for early reading intervention. First, it is 
argued that there exists a “Matthews Effect” for reading; that is, students who are behind in 
reading in first grade will fall further and further behind their age peers as they get older 
(Stanovich, 1986). Shaywitz and his colleagues (1995), however, have recently provided strong 
evidence that poor readers do not become worse relative to other readers as they progress 
through school, and in fact seem to make stronger gains as they get older, even when the data 
are adjusted for possible regression effects. Shaywitz et al. did find, though, that poor readers 
do not “catch up” to good readers, at least in the six years of data they analyzed (grades one to 
six), although they do begin to close the gap. 

Second, early intervention supporters claim that bridging the gap between poor and 
good readers early will lead to eventual success in higher grades. It follows from this claim that 
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it is important for children to be reading (or have related “reading skills”) at grade level from 
the very beginning of their schooling. The evidence reviewed here on early reading does not 
support the assumption that early intervention is in fact sufficient for long-term reading 
success. Intensive “reading recovery” programs involving one-on-one tutoring in the early 
grades have had some success in affecting reading achievement in later grades, although the 
outcomes are rarely measured after more than a few grades (e.g., Pinnell, Lyons, Deford, Bryk, 
Sc Seltzer, 1994). It should also be noted that early intervention programs are very costly 
(Hiebert, 1994), so the efficiency of the program versus other, less expensive interventions 
must be considered. 

Stanovich (1992) has stated that the initially small gains found in early intervention 
programs that emphasize phonemic awareness training (e.g., Bradley Sc Bryant, 1983) will 
have large, cumulative effects in the long run, countering claims by Yaden (1984; see also 
Krashen, 1996b) that such interventions may not warrant the effort expended on them. 
Unfortunately, the only experimental evidence Stanovich cites is an unpublished paper by 
Bryant in 1987, an apparent follow-up to his 1983 training study with Bradley. 

Third, it is argued that the gap between poor and good readers cannot be bridged (or 
perhaps is more difficult to bridge) in the later grades. Some evidence exists, however, that just 
as early intervention is not sufficient, it may also not be necessary. M.C. Smith (1993) examined 
the reading performance of 87 adults twenty years after they graduated from high school. In a 
multiple regression analysis, he found that reading scores from grades nine and twelve 
accounted for a large portion (79 percent) of the variance in the subjects’ current reading 
proficiency, but the addition of first-grade scores made no significant contribution to the 
variance explained. Smith’s report does not specify whether first-grade reading scores were 
highly correlated with the scores from later grades, which, if true, would make his findings 
about the importance of the high school level reading scores somewhat less convincing. From 
Smith’s presentation of the results, however, it appears as though a series of hierarchical 
regression analyses were conducted to compare the different amounts of variance accounted for 
by different grade levels, so that if the upper and lower grades were highly correlated, this effect 
would have shown up in his results. Smith’s data suggest, then, that reading performance in the 
first grade is not critical for long-term success, and therefore that intervention made in later 
years may be more important than early-grade programs. It is also important to point out that 
the gap between “poor” and “good” readers in some of the longitudinal studies conducted is 
often not very large. In Juel (1988), for example, the poor readers were only slightly below grade 
level upon entry into middle school (3.5 at the end of fourth grade). 

3. The results from a longitudinal study of an early intervention program reported in a recent 
paper (Leslie, Allen, & Calhoon, 1997) suggests this is indeed what happens. The researchers 
measured the fourth-grade reading achievement of a group of poor readers who received 
tutorial help in grades one through three. While all of the children were reading at grade level 
when they exited the early intervention program, almost half of them (47 percent) were below 
grade level again one to three years later. Early gains were not sustained. The results of this 
paper must be interpreted with caution, however, as a full report on the project has yet to be 
published. It is interesting to note that the intervention the children received in Leslie et al. 
(1997) consisted partly of asking the parents to read to their children, and sending home three 
to five books with the child after each tutoring session. A correlational analysis of third-grade 
reading scores and a variety of early literacy measures and intervention components found 
that these parental read-alouds were by far the strongest predictor of success (r = .63). This is 
consistent with the notion that elaborative assistance and book access are the most critical 
factors in reading achievement. A measure of phonological knowledge (rhyming ability), 
however, was not significantly correlated with later reading achievement (r = .28). It is not 
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known what sort of book access the children had once they exited the program and reached 
fourth grade, which may help explain their failure to remain at grade level in reading. 


Endnotes for Chapter 5 

1. Smith gives a good example of how reading is fundamentally “orthographic”—that is, a 
process of extracting meaning directly from the printed page without recourse to sound. 
Consider the following sentence: 

The none tolled hymn she had scene a pare of bear feat inn hour rheum. 

(Smith, 1973, p. 72) 


You can very easily detect the misspelled words in this sentence, a good indication that we get 
meaning mostly from the visual properties of words, not their sound. If sound were the 
primary way of extracting meaning from text, there would be no reason to think anything was 
wrong with this sentence. 

It has been said by some researchers (e.g., Horn & Manis, 1985; Stanovich, 1982) that 
good readers do not differ from poor readers in “orthographic processing ” that is, the use of 
writing without making recourse to sound, or “phonological processing.” However, Massaro 
and Sanocki (1993) point out that studies designed to measure orthographic information by 
breaking it down into smaller units, such as individual letters, as is typically done, do not give 
an accurate picture of the way orthographic knowledge is used in real reading, and hence will 
fail to detect potential differences. Barker et al. (1992) found that certain tasks tapping 
orthographic knowledge were much more strongly related to reading in elementary school 
children than the phonological tasks they used (phoneme deletion and pronouncing 
nonsense words). 

2. See also Goodman (1996), Krashen (1998a), Moustafa (1997), Smith (1994), and Weaver 
(1994), for a discussion of many of these same issues. 

3. It is also unclear just how well “concept of print” tests really measure print exposure. 
Morrow, O Connor, and Smith (1990) found that kindergartners in traditional reading 
readiness and literature-based programs did not differ on the posttest results from one such 
test, Clay s Concepts about Print, but were significantly different on other measures of print 
exposure, including the amount of reading they did at home, the number of authors and 
book titles they could name, and the types of different reading material they could identify. 
These differences had an impact in literacy development: The literature-based classes did 
significantly better on two comprehension measures (story retelling and probed recall), as 
well as on their attempted readings of storybooks. The reading readiness groups did not, 
however, do significantly better on “skills” tests measuring auditory and visual 
discrimination and letter identification despite their additional training on these areas. 


Endnotes to Chapter 6 

1. It has been claimed by many researchers that Smith's position was that we actually 
“skipped” over letters and words, which I believe is not a correct interpretation of his position 
(see Smith, 1973, pp. 56-58). Smith noted that “words can be identified with only half the 
featural information that would be required if prior letter identification were necessary, 
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provided that features sampled are taken from different locations within the configuration” 
(p. 57); and “there will be many alternative ‘critical sets’ of about 15 features per word, 
provided the features are a sampling from all the locations [in the word)” (p. 57). There is no 
implication that letters will necessarily be skipped over, as subletter features may be sampled 
from the entire array of the visual stimulus—that is, possibly from all the letters of the word. 

2. Identification of individual words may take place after the eye has scanned across them 
and after we identify meaning, as is the case with individual letters in a word. See Smith 
(1994) for a more thorough treatment of this topic. 

3. Of course, as reading proficiency increases, both decoding ability and context facilitation 
may lead to fewer errors of all types for both good and poor readers on a relatively easy 
passage. The skilled readers in Stanovich et al. (1984) scored at a grade equivalent of 2.9 in 
the spring, while the text had a readability level of grade one. As expected, skilled readers 
made virtually no errors on the passage when tested in the spring (0.8 errors on the coherent 
paragraph). Good readers will be both more accurate and better users of context than poor 
readers when the situation warrants. 

4. Smith (1994), commenting on the context studies by Stanovich and others, notes that the 
entire argument may be misplaced: His own conception of “context” extends far beyond the 
words on the page and is mostly that which is in the reader’s mind, the “nonvisual” 
information we possess. 

5. One vote given for each measure included on Chall’sTable 4-1A and 4-IB (combining first 
and second grades). I have adjusted the count to reflect the errors Carbo (1988) pointed out 
in Chall’s analysis. Chall (1989), in responding to Carbo, did not contest the errors in her 
analysis of these particular cases. 

6. Morrow conducted another literature-based study at the kindergarten level that is not 
included here, since the children had not yet begun to read independently (Morrow, 
O’Connor, & Smith, 1990). It is interesting to note, however, that those exposed to more print 
did much better in story comprehension as well as the amount and types of reading they did 
at home, demonstrating the effects of print exposure even before independent reading. 
Morrow et al. compared the effects of extensive print exposure through a storybook reading 
in eight “at-risk” kindergarten classrooms. Four of the classes engaged in traditional reading 
readiness activities, quiet book reading, read-alouds, retellings and repeated readings of 
stories, and other literature-related activities. The four control classes had significantly less 
exposure to print, and instead used a “reading readiness” program that focussed on letters of 
the alphabet as well as some storybook reading. The researchers made weekly observations 
of the classrooms to ensure treatment fidelity of the literature-based component, which lasted 
about 60 minutes per day. Two tests of comprehension were administered (story retelling and 
probed recall), on which the experimental groups did significantly better. The children in the 
literature-based classes were also able to name more book titles and authors, and reported a 
greater amount of reading at home (pp. 266-267). 

7. In addition to increasing student access to books, the programs that Gambrell (1996) 
evaluated also used some incentives (bookmarks and books) to motivate the children to read 
more. It is not clear that these incentives were critical to the programs’ success, however, since 
there was no control group that received increased access to books but no rewards. In 
previous studies, access to books alone has been found effective in increasing reading 
frequency (Krashen, 1993); incentives alone have not (McQuillan, 1997b). Incentives have not 
been found to increase motivation in other academic areas (Kohn, 1993), and in fact can have 
a negative impact on both motivation and achievement (Deci 8c Ryan, 1992). 
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Endnotes for Chapter 7 

1. One apparent counterexample to the experimental evidence on the positive effects of more 
reading and book access is Carver and Leibert (1995), whose subjects did not seem to improve 
in reading after an seven-week summer “free reading” session. The study had numerous flaws, 
however, that make the results completely understandable, even predictable. The most serious 
error was that students were not allowed to read any books even slightly above their 
predetermined grade level! This restriction itself almost guaranteed that no progress in 
reading comprehension or vocabulary would be made. In addition, the students had a very 
small number of books to choose from, again limiting their ability to read voluminously 
enough to benefit from the time spent in the program. 

2. This approach was recommended to me by Stephen Krashen. 

3. Since a state’s mean SAT score is very strongly and negatively related to the percentage of 
total high school students taking the exam (r = -.86), this percentage was used as a weighting 
variable in the analysis to control for this source of bias in the scores. 

4. It may be somewhat surprising to some that school expenditures did not directly predict 
reading scores among high school students. The situation is more complicated than what is 
represented in this analysis. Lance et al. (1993), in a more sophisticated path analysis of access 
and reading at the district level in Colorado, found that spending did affect reading 
achievement in so far as it has an impact on the collections of the school library. A more 
detailed examination of the relationship between total spending and library expenditures is 
needed to confirm this finding at the national level. Similarly, class size as determined by 
teacher-student ratio did not by itself predict high school reading achievement. Again, Lance 
et al. found that such a relationship was mediated by the school library: Teacher-student ratio 
was related to school library-media center size, which in turn predicted academic 
achievement. The significant correlation in the current study between teacher-student ratio 
and books per pupil supports Lance et al.’s finding, in that the larger the average classroom, 
the fewer the number of books per student in the school library. 

5. Martinez and her colleagues (Martinez, Roser, Worthy, Strecker, & Gough, 1997) point out 
that the notion of “access” should include not merely the number of books physically present 
in a given location, but the readability and interest of the texts themselves. In their study of 
three low-socioeconomic-status classroom libraries, they found that many of the books in the 
classroom were not read (or likely to be read) by the children precisely because they were too 
difficult for them. They also noted that many others were “dated, worn... and inappropriate 
in other ways” (p. 271). 

6. I will speculate that there is one sense in which the “try harder” solution is correct and can 
even be demonstrated by cases of those who have “pulled themselves up by their own 
bootstraps.” This is because if you lack the proper resources, extraordinary measures are the 
only way to succeed (although they still may be insufficient). It is thus not surprising to find 
media stories about the occasional student who “makes it” attending a poorly ftinded school, 
and (rightly) points to his or her own heroic efforts and determination. How else could he or 
she have succeeded? Rather than make the case that effort is the essential ingredient, these 
cases demonstrate how difficult such a feat really is compared to that routinely performed by 
those who come from more advantaged backgrounds. Applying such Herculean efforts, then, 
is not the path taken by the typical child of most middle- and upper-income families. As the 
old joke goes, we shouldn’t confuse being born on third base with hitting a triple (thanks to 
Stephen Krashen for this last observation). 
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